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Benefits of sharing 


A swift and effective response to emerging infectious diseases demands that researchers have ready 
access to the latest data on the pathogens responsible. There is stilla long way to go to ensure this. 


Zika dominates the news. The virus, which usually causes only 

mild symptoms, has been linked to a reported increase in the 
number of babies born in Brazil with microcephaly — abnormally 
small heads and brains. The possible implications of this for pregnant 
women demand a rapid and evidence-based approach. 

The immediate priorities are to gather epidemiological and clinical 
data to establish whether the apparent spike in cases is real and, if so, to 
what extent the Zika virus is involved (see page 142). And researchers 
elsewhere must have full access to all of this information as soon as it 
is available. 

Conventional scientific publishing, based on rounds of peer review, 
can be too slow to rapidly disseminate research findings during a pub- 
lic-health emergency. One solution is the immediate release of data to 
public databases and subsequent publication of peer-reviewed analysis. 
As we have said before, prior release of data and analysis to public data- 
bases, preprint servers and forums will not jeopardize consideration 
ofa submission to Nature journals. And Nature journals will make all 
papers relating to Zika virus free to access until further notice. 

Already, there have been promising moves to make data on 
microcephaly, and on the epidemic of Zika in the Americas, readily 
accessible. The World Health Organization (WHO) has announced a 
‘Zika Open initiative, in which all relevant submissions to its Bulletin 
will be posted online within 24 hours. 

In this issue, we publish research that demonstrates the need for 
rapid data sharing during outbreaks. As detailed on page 228, genome- 
sequencing technology has advanced to the point at which the whole 
genome ofa virus sample can be sequenced in the field within 24 hours 
by means of a portable sequencing system. Previously, such sequencing 
during the course of an epidemic has been slower because it has relied 
on sending samples to a laboratory. Although this method still faces 
technical challenges, it should prove a crucial tool in epidemiological 
research. It offers the potential to rapidly trace how a disease is being 
passed from person to person, and so help to guide authorities to direct 
resources that can break these transmission chains. 

But the full potential of this advance can be realized only if scientists 
can access sequence data obtained from samples taken at different 
times and places during an outbreak. Pathogen genome sequences are 
most useful when studied alongside epidemiological and clinical data. 

Many scientists across the research fields, but notably in genomics, 
have been enthusiastic champions of early data release. Infectious- 
disease researchers make use of public forums and databases such as 
virological.org and GISAID (Global Initiative on Sharing All Influenza 
Data). The former resource is becoming a major platform for shar- 
ing and discussing preliminary analyses of data and it already hosts a 
genomic analysis of the Zika virus. And, launched just over four years 
ago, ISARIC (International Severe Acute Respiratory and Emerging 
Infection Consortium) works with clinicians and epidemiologists to 


A nother year, another virus. As the Ebola-virus epidemic recedes, 


put in place pre-agreed protocols and data-sharing processes that can 
then quickly be adapted to a new situation. 

Almost one year ago, this journal published a Comment article 
that called for the immediate sharing of outbreak data, a policy that 
the authors themselves had adopted during their early sequencing of 
Ebola-virus genomes in the 2014-15 outbreak (N. L. Yozwiak et al. 
Nature 518, 477-479; 2015). They also called on the WHO to convenea 
meeting to develop guidance for data sharing during infectious-disease 
outbreaks. Such a meeting took place last September. It was attended by 
government representatives, public-health agencies, scientists, research 
funders, ethicists and publishers. All acknowledged that pathogen 
sequence information collected during a public-health emergency is of 
greatest value when released openly, in as close to real time as possible. 

A statement released by the WHO after the meeting emphasized 
the “fundamental moral obligation” of every researcher who generates 
information related to a public-health emergency to share their prelimi- 
nary results once these have undergone quality control. Representatives 
from leading biomedical journals also unequivocally emphasized that 
disclosure of such information would not prejudice journal publication. 

Scientists still face challenges to swift data sharing. For example, as 
was seen during the emergence of the H5N1 and H7N9 avian influenza 
viruses and Middle East respiratory syndrome coronavirus, rapid data 
sharing can be hampered by a lack of international rules that govern 
how credit and rights (including intellectual property) should be fairly 
distributed among scientists and authorities in the countries where 
outbreaks occur as well as researchers based elsewhere. 

To play our part in driving the shift towards fast data sharing during 
public-health emergencies, Nature journals will now encourage authors 
who havent already deposited their relevant sequence information in 
public archives to do so on submission. = 


A good precedent 


Jimmy Carter’s efforts to eradicate Guinea 
worm should be applauded. 


hen former US President Jimmy Carter wanted Ghana to 
We his goal to eradicate the Guinea worm seriously, he 
came up with a novel threat. Carter told the country’s presi- 
dent that he would try to get the parasitic disease’s name changed to 
Ghana worm. “There isn’t a Guinea worm left in Ghana now,’ Carter 
told journalists with a grin in London last week. 
Carter might just be the United States’ most productive ex-president. 
In the 35 years since he left the White House, the peanut farmer from 
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Georgia has brokered peace deals, helped to see off despots such as 
Panama's Manuel Noriega and still found time to spend one week each 
year building houses for the needy. 

But Carter’s most lasting accomplishment is likely to be disease 
eradication. Last year, just 22 people in 4 countries in sub-Saharan 
Africa had Guinea-worm disease, compared with the 3.5 million who 
suffered the parasitic infection in one year in the mid- 1980s, when the 
non-profit Carter Center in Atlanta, Georgia, began to lead a global 
campaign to wipe out the disease. 

The former president spoke last week at a meeting to drum up 
support for Guinea-worm-eradication efforts, barely showing his 
91 years or his fight with cancer (he revealed last year that he has 
metastatic melanoma). He has said that he wants the last Guinea worm 
to leave the world before he does, and he may also live long enough 
to see the back of another global menace: the three-decade effort to 
eradicate polio is nearing completion. The virus is now present in only 
Afghanistan and Pakistan, which together recorded an all-time low 
of 73 cases last year. 

Neither disease will go without a final struggle. Violence and political 
instability threaten both efforts, and a Guinea-worm epidemic in dogs in 
Chad demonstrates the unpredictability of disease eradication. But both 
campaigns have come too far and cost too much to be left incomplete, 
and world leaders should provide the resources needed to finish the jobs. 

What next? Carter is among those who would like to see other dis- 
eases exterminated. His centre launched the International Task Force 
for Disease Eradication, which has drawn up a list of candidates, includ- 
ing measles, mumps and lymphatic filariasis (a parasitic infection also 
known as elephantiasis). “We're ready and eager to go on several dis- 
eases,’ Carter said. He even needled the World Health Organization for 
its sluggishness in adopting Guinea-worm eradication as an official goal, 
and its reluctance to target other diseases for eradication. 

Carter might be right on Guinea worm, but given the tools available 
to twenty-first-century science and medicine, is an all-out assault to 
eradicate disease still the best way to go? 

Putting the fight against a pathogen on a pedestal can warp efforts 
to tackle other diseases and develop health systems. Amid regular 


door-to-door campaigns to deliver oral polio vaccines, northern 
Nigeria had (and still has) low rates of routine childhood vaccination 
for diseases such as measles and diphtheria. (Officials have argued that 
future eradication campaigns could do more to improve health sys- 
tems, for instance by starting eradication efforts in a particular country 
only once routine immunization coverage reaches a certain threshold.) 
The Guinea-worm and polio campaigns have been decidedly old- 
school in approach. The main tool against polio has been a decades-old 
oral vaccine (although a switch to an injectable 


“Eradication vaccine is under way). There is no vaccine or 
success spares treatment for Guinea worm, and eradication 
countries from efforts have focused on community-based 
the economic approaches to change behaviour, such as 
burdens of a teaching people to filter all drinking water, to 


avoid recontaminating water sources, and to 
report cases to health authorities. 

Future eradication campaigns have obvious appeal. Although they 
are costly, success spares countries from the economic burdens of a 
disease. Guinea-worm efforts have also helped to provide clean drink- 
ing water to millions of people in some of the world’s poorest and most 
remote areas. And eradication is an appealing target that can court 
donors who might not otherwise support public health. 

Future eradication campaigns (and other public-health efforts) 
should find inspiration in the fact that Guinea-worm eradication has 
been focused on behavioural change, but they are likely to have other 
approaches to choose from. One of the most promising is gene-drive 
technology, which allows a gene that is harmful to a pathogen or vector 
to rapidly spread through a sexually reproducing population. 

Last year, two teams developed experimental gene drives that made 
mosquito populations infertile or resistant to malaria. It will be years 
before the technology is ready for field trials. But it is one reason that 
people are beginning to talk seriously again about malaria eradication, 
after failed efforts in the 1950s and 1960s. 

Gene drives could also be applied to other vector-borne infections, 
including tick-borne infections such as the bacteria that cause Lyme 
disease. President Carter’s hit list may yet grow longer. = 


disease.” 


Outside the bubble 


Governments must stop proposing solutions and 
invest in large-scale removal of carbon dioxide. 


sang musician Paul Simon. And every political cycle, it seems, 
promotes its own signature solution to the problem of how 
to curb and prevent climate change. This year’s answer is bioenergy 
with carbon capture and storage — a plan ona colossal scale to grow 
grass and trees on an area half the size of the United States, harvest 
and ship this biomass to power stations, burn it and then trap the 
carbon dioxide from the exhaust gases. The greenhouse gas would 
then be piped underground and stored indefinitely. The scheme’s 
acronym — BECCS — barely does its complexity justice. 
Perhaps this is the first you have heard of BECCS. That wouldn't be 
a surprise. It is something of an overnight sensation — the boy band 
launched to number one on the back ofa reality television show, rather 
than the grizzled rockers who earned their fame after years of con- 
certs attended by three people and a dog. Yet at the Paris climate talks 
late last year, which were widely acclaimed as a triumph, the BECCS 
scheme was quietly installed as the world’s Plan A. 
That's because it comes with a very catchy tune that politicians can’t 
get out of their heads. BECCS solves the problem of future carbon emis- 
sions and cleans up the past. The plants suck CO, from the atmosphere, 


Be generation throws a different hero up the pop charts, 
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which ends up safely underground. We get the benefit ofburning them 
to generate electricity, the world gets to keep its power infrastructure, 
and the atmosphere experiences what BECCS enthusiasts call negative 
emissions. Rather brilliantly, the more energy the scheme produces, the 
more the planet edges away from dangerous levels of global warming. 

What’ not to like? Well, in a World View article published after the 
Paris talks, climate scientist Kevin Anderson compared BECCS toa fairy 
godmother, conjured up to wish away reality in a puff of optimism (see 
Nature 528, 437; 2015). And ina Comment piece on page 153, envi- 
ronmental scientist Phil Williamson takes a hard look at some of the 
questions that BECCS seems to pose, and finds few answers. 

How would we preserve forests and grasslands, faced with such a 
demand for energy crops? How much carbon would be released during 
the agricultural stage? How much water will we need, and where will 
we get it? How much will it cost to build the network of compressors, 
pipes, pumps and tanks that will be needed to liquefy and transport the 
separated CO,? Can it even be separated at a sensible cost? 

Recent years have seen a series of solutions to global warming thrown 
up the political agenda. From biofuels and carbon offsets to ocean ferti- 
lization and conventional carbon capture, each has had its moment in 
the sun, only to be replaced by something younger with a new sound. 

BECCS may yet prove to have staying power. But to avoid another 
one-hit climate wonder, governments must spend the money to do 
the groundwork — as Williamson says — and 
to answer those questions and plenty more. 
Politics is the art of the possible. But serious 
action on climate change must be based on the 
science of the probable. = 


NATURE.COM 
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Iam asked regularly for help and advice by women who have 

been harassed. And the problem has been laid out for the broader 
research community in a series of recent cases. The US National Science 
Foundation, among other organizations, has reminded universities of 
their obligations to investigate allegations of harassment and to punish 
perpetrators accordingly. Yet the system continues to protect the 
reputation of harassers and the institutions that employ them. It is not 
enough for affected individuals to speak up. They must also know that 
their complaint will be dealt with properly. Here are some suggestions 
that could improve this situation. 

Everyone should know the law. Where sexual harassment is illegal, 
the bar for inappropriate behaviour is lower than some might think. In 
the United States, for example, harassment constitutes behaviour that 
is “sufficiently severe, persistent, or pervasive to 
limit a student's ability to participate in or benefit 
from an educational program or activity, or to cre- 
ate a hostile or abusive educational environment,’ 
according to the Department of Education. 

Students, members of faculty and other uni- 
versity staff sometimes misinterpret these words 
to mean that severe, persistent or pervasive 
behaviours define a hostile environment. They do 
not understand that these terms can stand alone. 
Behaviour that many people might regard as being 
innocuous — but that is persistent, targeted or 
timed to have a negative effect — can limit the 
ability of a student to learn and is unacceptable. 

Every university should have an office to 
which members of its community can talk 
anonymously about harassment — an ‘Office of Good Advice’ It must 
be fundamentally separate from the affirmative-action office, legal 
team and police force of the university — all of which are obliged to 
report apparent breaches of the law. 

The Office of Good Advice should be well known to all. It should be 
staffed by professionals, and it should be the first result that is returned 
in a web search for ‘sexual harassment: Anyone on campus who needs 
to talk about harassment should know which university employees will 
report incidents and which can keep such reports confidential. 

Every university needs an Office of Good Advice. This is so that stu- 
dents need not fear that they will be pressed to make a formal, legally 
viable report at offices with legal responsibility, and because staff who 
are released from legal obligations are more likely to be able to listen 
with objectivity and sensitivity. Those who have been harassed might 
later report that they were asked intimidating and 


S= harassment is a vile plague on science. I know this because 


inappropriate questions that seem toundermine NATURE.COM 
the validity of their complaint, such as:“Were _ Discuss this article 
you drinking?” or “Are you unhappy with your __ online at: 


grade in his course?” No matter how good their —_ go.nature.com/7leinz 


EVERYONE 
SHOULD BE 


ENCOURAGED 
TO SHARE THEIR 
VIEWS ON THE 


WORKPLACE 


CULTURE. 


Change the system to 
halt harassment 


Universities and their senior staff must do more to deter, detect and 
punish all forms of inappropriate behaviour, says Joan Schmelz. 


intentions, members of staff in offices who must uphold the law cannot 
help but be influenced by this responsibility and their knowledge of the 
requirements of an investigation. 

The Office of Good Advice needs to keep track of multiple complaints 
against the same harasser. A university, by its very nature, has a fluid 
population of students and postdoctoral researchers. A way in which 
serial harassers can continue is by targeting one individual at a time, 
perhaps in a different class or a year apart, so that their targets never 
meet and are unaware of each other. 

At present, reports of sexual harassment are filed under the name of 
the complainant, in part to protect the identity of the alleged harasser. 
One small step in the right direction would be to file the report under 
the name of the harasser. There is reason to be cautious, but the law — at 
least in the United States — makes it clear that universities should have 
the flexibility to investigate allegations thoroughly. 

Department heads must step up. They have the 
power to change the culture and to create an envi- 
ronment in which everyone feels able to do their 
best work. In fact, it is a fundamental component 
of their job description. Although they should 
embrace their responsibility, too many senior staff 
choose not to. They convince themselves that they 
are too busy: after all, they have reports to file, 
budgets to balance, proposals to write, postdocs 
to hire and students to supervise. 

These senior figures should make themselves 
and their staff familiar with the local anti-harass- 
ment policy and available resources. They should 
post the policy and information on how to access 
the resources in a prominent place. Such actions 
clearly demonstrate that the department considers the issue to be impor- 
tant and takes it seriously. 

Department-wide meetings on the topic of sexual harassment should 
also be convened, with help from on-campus resources. The chair of the 
meeting and the university hierarchy should affirm that harassment will 
not be tolerated. The existing policy and resources should be discussed. 
Everyone should be encouraged to share their views on the workplace 
culture — with the group, in smaller, targeted meetings or anonymously. 

What can everyone else do? We can project that we are safe to come 
to for advice, simply by interjecting with “I disagree’, when inappropri- 
ate or sexist comments arise in group discussion. This small phrase can 
go along way. And if someone comes to you with a problem, do not 
immediately question their story. Instead, listen respectfully. You can 
be certain that if action is going to be taken, the appropriate questions 
will be asked in due course. = 


Joan Schmelz works for the Universities Space Research Association 
and is deputy director of the Arecibo Observatory in Puerto Rico. 
e-mail: jschmelz@usra.edu 
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| GENOMICS 
Early risers share 
genetic signature 


Three different teams have 
identified genetic variants that 
are associated with being a 
morning person. 

Youna Hu, David Hinds 
and their team at 23andMe, 

a personal-genetics firm in 
Mountain View, California, 
analysed genome data from 
nearly 90,000 customers who 
stated in a survey whether 
they were a morning person 
ora night owl. Teams led 

by Michael Weedon at the 
University of Exeter Medical 
School, UK, and Richa Saxena 
at the Massachusetts General 
Hospital in Boston conducted 
similar analyses on the DNA 
of more than 100,000 people 
who participated in the UK 
Biobank project. 

Each team found a dozen or 
more variants that were linked 
to a person's preference for 
sleep times — and many were 
in or near genes that help to 
regulate the body’s circadian 
clock. 

Nature Commun. 7, 10448 (2016); 
preprints at bioRxiv http://doi. 
org/bcb7 (2016); http://doi.org/ 
bcb8 (2016) 


More carbon from 
planted forests 


Planting trees will not 
necessarily slow climate 
change. 

Kim Naudts at the 
Laboratory of Climate 
Science and Environment in 
Gif-sur-Yvette, France, and 
her colleagues paired a history 
of land-use in Europe with 
land and atmospheric models 
to study the effect of forests 
on the climate. Although 
the continent's forests have 
expanded by 10%, timber 
harvesting and a shift to more 


Selections from the 
scientific literature 


PARASITOLOGY 


Honeybee virus spread by human activity 


The commercial trade in honeybees has helped to 
spread a deadly bee virus around the globe over 


the past century. 


Deformed wing virus reduces the winter 
survival of European honeybees (Apis mellifera), 
and could be a factor in the large colony losses 
seen in some parts of the world. To find out how 
the virus became pandemic, Lena Wilfert at the 
University of Exeter, UK, and her colleagues 
analysed the virus’s genome to reconstruct its 


commercially valuable trees 

— mainly the fast-growing 
conifers — have resulted in the 
release of more than 3 billion 
tonnes of carbon into the 
atmosphere since 1750. 

The change from deciduous 
trees to darker-leaved conifers 
contributed to a rise of 0.12 °C 
in local surface temperatures. 
Science 351, 597-600 (2016) 


Brain cells in wells 
make amyloid 


Many different types of brain 
cell can secrete high 

levels of the peptide 
amyloid-B, which forms the 
brain plaques associated with 
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evolutionary and geographical history. The team 
found that the virus has spread from Europe 


in the past century, in part because of the trade 


Alzheimer’s disease. 

To detect molecules from 
single brain cells, Tracy 
Young-Pearse at Harvard 
Medical School in Boston, 
Massachusetts, and her 
colleagues designed arrays of 
nanometre-sized wells. They 
took cells from healthy people 
and from individuals with 
familial Alzheimer’s disease, 
reprogrammed them into stem 
cells and used them to derive 
a variety of brain-cell types, 
which they then placed in 
the wells. The authors added 
antibodies that would detect 
specific molecules. 

Subsets of the brain 
cells secreted amyloid-B 
at different rates, and even 
non-neuronal cells such as 


in commercial colonies. New carriers of the 
virus also contributed: the Varroa mite (Varroa 
destructor) began to infect European honeybees 
about 50 years ago, when this pandemic started. 

Tighter controls on commercial colonies are 
needed to slow the virus’ spread, the authors say. 
Science 351, 594-597 (2016) 


astrocytes produced high 
levels. More of the cells 
derived from people with 
Alzheimer’s disease secreted 
large amounts of the peptide 
compared to cells from 
healthy individuals. 

J. Neurosci. 36, 1730-1746 (2016) 


| NEUROSCIENCE 
Molecule protects 
ageing neurons 


A small molecule reverses the 
neuronal deterioration that is 
typically found in middle-aged 
animals. 

Compared with the neurons 
of young animals, nerve cells 
in middle-aged rats generally 
have fewer branches. Their 


STEPHEN MARTIN, UNIV. SALFORD. 


OLIVIER BARBAROUX/IFREMER 


CURR. BIOL. 


connections, or synapses, 
also seem to be less able to 
strengthen with repeated 
activation — changes that 
could help to explain declines 
in learning and memory over 
time. Christine Gall, Gary 
Lynch and their colleagues at 
the University of California, 
Irvine, preserved neuronal 
branching and boosted 
synapse strengthening in 

rats by treating them with a 
molecule that enhanced the 
activity of the AMPA-type 
glutamate receptor. This 
increased excitatory brain 
signalling. Animals treated 
for three months also showed 
improvements in some tests of 
long-term memory. 

The authors suggest that the 
mechanism could one day be 
used to protect against age- 
related neuronal decline. 

J. Neurosci. 36, 1636-1646 
(2016) 


PALAEONTOLOGY 


Mammal witha 
dinosaur nose 


A wildebeest-like creature that 
lived in Africa 70,000 years 
ago had a trumpet-shaped 
nasal passage, similar in shape 
to those of some dinosaurs. 
Rusingoryx atopocranion 
was previously known from 
incomplete specimens, but a 
team led by Haley O’Brien at 
Ohio University in Athens and 
Tyler Faith at the University 
of Queensland in Brisbane, 
Australia, have discovered. 
six more-complete skulls on 
an island in Lake Victoria 
in Kenya (one of the skulls 
is pictured). X-ray scans 
revealed large, hollow nasal 
passages that extended out 
from the top of the animals 
heads. Known as nasal domes, 


these have previously been 
seen only in some hadrosaurs 
that lived around 65 million 
years ago — a rare example of 
how similarities in behaviour 
and environment can spur the 
evolution of a common feature 
in distantly related animals. 
Acoustic modelling 
suggested that the structures 
allowed R. atopocranion 
to vocalize at very low 
frequencies, possibly helping 
herds to communicate without 
alerting predators, the authors 
propose. 
Curr. Biol. http://doi.org/bedn 
(2016) 


Languages have 
common structure 


Many languages share a 
universal semantic structure 
that is independent of their 
speakers’ culture, environment 
or how closely the languages 
are related. 

Hyejin Youn and Tanmoy 
Bhattacharya at the Santa 
Fe Institute in New Mexico 
and their colleagues studied 
the words for 22 universal 
concepts — such as ‘Sum’ and 
‘water’ — across 81 languages, 
and identified which words 
had multiple meanings. They 
linked related words and 
meanings together to form 
network maps, and found 
common patterns in the way 
that different meanings were 
connected to each other. For 
example, words for ‘sea and 
‘salt were closer to each other 
than to words for ‘Sun, and this 
structure was preserved across 
all the languages. 

This method could help 
to reveal concepts that are 
universal features of human 
cognition and language use, 
the authors say. 
Proc. Natl Acad. Sci. USA 
http://doi.org/bb94 (2016) 


Better battery 
with more juice 


Researchers have made a 
battery that can both hold high 
energy and deliver high power. 
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SOCIAL SELECTION 


Popular topics 
on social media 


‘A field of crop scientists’ 


When cancer researcher Ritankar Majumdar published a 

blog post about the collective names of doctors and scientists 

on 2 February (go.nature.com/fzcnqt), he had no idea he 

would inspire #scientistherdnames. Hundreds of tweets 

with the hashtag, including “a cloud of data scientists” and 

“a nucleus of physicists’, soon dominated scientists’ Twitter 
streams. Some tweets reflected the angst 


> NATURE.COM 
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Batteries typically can hold 
alot of energy but discharge it 
at low rates. To make a material 
capable of both high energy 
and high power, a team led 
by Yan Yu at the University 
of Science and Technology of 
China in Hefei placed sodium 
vanadium triphosphate 
(Na,V,(PO,),) inside a mixture 
of reduced graphene oxide and 
carbon nanotubes to improve 
the sodium compound's 
conductivity. They deposited 
the material onto a current- 
collecting stainless steel 
support, avoiding the use of 
additives that would reduce the 
performance of the device. 

The team made a battery 
with the porous material 
and showed that the device 
could deliver high levels of 
energy at a high rate. It also 
kept 96% of its capacity even 
after being charged and 
discharged 2,000 times. The 
approach could be used with 
other electrode materials, the 
authors say. 

Adv. Mater. http://doi.org/f3kvx3 
(2016) 


Ocean plastic 
hurts oysters 


Oysters that consume the small 
pieces of plastic littering the 
world’s oceans produce fewer 
and less-robust offspring than 
their unexposed counterparts. 
Millions of tonnes of plastic 
end up in the oceans every 
year, and tiny ‘microplastic 


experienced in the lab. Erol Akcay, a 
theoretical biologist at the University of 
Pennsylvania in Philadelphia, tweeted: 
“A grant of silent sobs?” 


fragments — those smaller 
than 5 millimetres in 
diameter — could be harming 
marine life. Arnaud Huvet at 
the French marine-research 
agency IFREMER in Plouzané 
and his colleagues placed 
Pacific oysters (Crassostrea 
gigas; pictured) in water 
laced with micrometre-sized 
polystyrene spheres, at levels 
estimated to be similar to 
those where oysters live in 
the wild. After two months, 
oysters exposed to the plastic 
produced fewer and smaller 
egg cells, less-mobile sperm 
and fewer offspring than did 
animals raised in water without 
the plastic. The offspring also 
grew more slowly. 

Ingested plastic might 
be disturbing the oysters’ 
digestion and releasing 
endocrine-disrupting 
chemicals, which affect 
reproductive systems, the 
authors say. 
Proc. Natl Acad. Sci. USA 
http://doi.org/bcdm (2016) 
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SEVEN DAYS nescnnss 


Zika funding boost 
US President Barack Obama 
requested US$1.8 billion in 
emergency funding to fight 
Zika virus, the White House 
announced on 8 February. 

The mosquito-transmitted 
virus might be linked to 

head malformations called 
microcephaly in newborn 
infants, and has mostly 
affected countries in South 
America. The World Health 
Organization has declared 

the rise in microcephaly a 
global health emergency. The 
president’s proposal is part 

of his 9 February US budget 
request, and would give around 
$1.5 billion to the Department 
of Health and Human Services, 
including $828 million for 

the US Centers for Disease 
Control and Prevention to 
research the virus, and about 
$200 million to develop a 
vaccine. The US Agency for 
International Development will 
get $335 million. 


Copenhagen cuts 


Government budget cuts in 
Denmark have forced the 
University of Copenhagen 

to slash staff numbers. More 
than 500 employees will lose 
their jobs, one-quarter of 
them researchers, according 
to a4 February news release 
from the university. Voluntary 
redundancies and discontinued 
contracts will make up 323 of 
the job losses, and 209 staff 
can expect to be laid off. The 
university will also reduce its 
PhD intake by 10%.The job 
losses come after the Danish 
government cut funding for 
research by 1.4 billion kroner 
(US$210 million) in October. 


Climate-job losses 
Hundreds of climate 
researchers at Australia’s 
national science agency, the 


Reactor roars into life 


German Chancellor Angela Merkel ignited the first 
hydrogen plasma at the Wendelstein 7-X stellarator in 
Greifswald on 3 February, marking the device's official 
launch. The fusion reactor’s design differs from that of the 
other most-promising fusion facility, ITER, being built 
near Cadarache, France. The official switch-on produced 
a hydrogen plasma for just a fraction ofa second, but 

the helical-shaped stellarator should be able to operate 
continuously — a major advantage over the pulsed 
generation of ITER’s tokomak design. 


Commonwealth Scientific 
and Industrial Research 


(US$108 million) over 4 years. 
See go.nature.com/apbkmk for 


Organisation (CSIRO), are more. 

set to lose their jobs. Chief ? 

executive Larry Marshall Drug-trial probe 
said on 4 February that the A preliminary investigation 


agency would be shedding up 
to 350 jobs over the next two 
years as it shifts its strategy 
away from basic climate 
science. CSIRO, which employs 
more than 5,000 people, had 
already dropped 1,300 jobs in 
the 2 years up to June 2015, 
largely because the 2014 
federal budget cut the agency’s 
funding by Aus$115 million 
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into the death ofa participant 
ina first-in-human drug trial in 
France, and the hospitalization 
of five others, has failed to 
establish a cause. In a report 

to health minister Marisol 
Touraine on 4 February, 

the general inspectorate for 
social affairs criticized trial 
contractor Biotrial for giving 
doses of the test drug to other 


subjects on 11 January, despite 
the first subject having been 
hospitalized the previous 

day. The trial was halted later 
that day, and the report says 
that Biotrial respected trial 
protocols. The five patients 
have since been discharged 
from hospital. 


Methane lawsuit 
The state of California 
announced on 2 February that 
it had filed a lawsuit against 
the Southern California 

Gas Company for failing to 
promptly report and halt 
emissions from a massive 
natural-gas leak at one of its 
storage facilities. Discovered 
on 23 October last year, the 
gas leak at the Aliso Canyon 
natural-gas storage facility 
has yet to be controlled. As 
of 8 January, cumulative 
methane emissions from the 
facility were estimated to 

be equivalent to more than 

2 million tonnes of carbon 
dioxide, or 2% of the state's 
greenhouse-gas emissions 
over the 11-week period. 


| _BUSINESS 
CRISPR goes public 


Editas Medicine, a 

company in Cambridge, 
Massachusetts, that aims 

to create therapies from 
CRISPR-Cas9 gene-editing 
technology, went public 

on 2 February, raising 
US$94.4 million. The initial 
public offering was seen as 

a vote of confidence both in 
the sluggish biotechnology 
market and in a tool that has 
yet to reach clinical testing. 
Several pioneers in CRISPR- 
Cas9 technology are among 
the founders of the company. 


China chemicals bid 
China National Chemical 
(ChemChina) has offered 

to buy agrochemicals firm 
Syngenta in a US$43-billion 
deal that could alter the 


oa 
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landscape of the industry. 
Syngenta, headquartered in 
Basel, Switzerland, is best 
known for its agricultural seed 
and pesticides business, and 
was wooed by its competitor 
Monsanto in a takeover bid 
last year. But on 3 February, 
Syngenta announced that it 
had accepted an offer from 
ChemChina of Beijing. The 
deal must be approved by 
regulators before it can be 
completed. 


HHMI president 


The Howard Hughes 
Medical Institute (HHMI) 

in Chevy Chase, Maryland, 
has appointed its first female 
president. Erin O'Shea, 

a systems biologist from 
Harvard University in 
Cambridge, Massachusetts, 
will take office in September, 
the organization announced 
on 3 February. O’Shea, whose 
speciality is gene regulatory 
networks, has served as 
HHMI’; chief scientific officer 
and vice-president since 
2013. She will be HHMI’s 
sixth president, replacing 
biochemist Robert Tijan of 
the University of California, 
Berkeley. 


Macchiarini out 
On 4 February, the 
Karolinska Institute (KI) 
in Stockholm announced 
that it will not renew the 


TREND WATCH 


South Korea is the world’s most 


research-intensive economy, 
according to figures from the 
Organisation for Economic 


Co-operation and Development 
(OECD). In 2014, South Korea 


invested 4.29% of its gross 
domestic product (GDP) in 
research and development 
(R&D) — stretching ahead of 
Israel, which invests 4.11%. 
The South Korean increase is 
largely due to rising industrial 


investment, and the country is 
aiming to spend 5% of GDP on 


research by 2017. 


contract of surgeon Paolo 
Macchiarini (pictured) when 
it ends in November 2016. 
The KI visiting professor 
pioneered artificial-windpipe 
transplants. A Swedish 
documentary alleged that 

he broke ethical guidelines, 
prompting the KI to consider 
reopening an investigation 
into Macchiarini’s work 

and to ask him to phase out 
his research. The KI has 

also ordered an external 
investigation into the way 

the case was handled. Urban 
Lendahl, a geneticist at the KI, 
resigned as secretary-general 
of the Nobel Assembly on 

6 February because he expects 
to become involved in the 
investigations. See go.nature. 
com/rh1ne4 for more. 


Europe's life-sciences 
organization EMBO has 
extended its reach in Asia. 
On 4 February, India became 


EMBO’ second Associate 
Member State. Researchers 
in India are now eligible 

to participate in all EMBO 
programmes, including 
courses and workshops, long- 
and short-term fellowships 
and the EMBO Young 
Investigator Programme. 
Singapore became the 
organization’ first Associate 
Member State last July. 


Italian misconduct 
The ethics committee of 
Italy’s National Research 
Council, the CNR, announced 
on 9 February that it has 
identified 17 cases of scientific 
misconduct in published 
papers co-authored by 
researchers in CNR labs. The 
CNR did not publicly identify 
the papers, but says that it 

is discussing next actions 

with the relevant authors. 

The committee assumed 
responsibility for research 
integrity two years ago, and 

is now examining a further 

61 cases. It published guidelines 
for good scientific practice in 
June last year, committing it 

to investigating all allegations 
of misconduct brought to its 
attention. 


Smoking laws 


Smoking bans around the 
world have measurably 
improved the health of 
non-smokers, according to a 
4 February review (K. Frazer 
et al. Cochrane Database Syst. 


SOUTH KOREA’S RESEARCH BOOM 


South Korean investment in research and development (R&D) 
is soaring as a percentage of its gross domestic product (GDP). 
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SEVEN DAYS | THIS WEEK | 


11 FEBRUARY 

A hotly anticipated 
announcement from 
the LIGO collaboration 
about the existence of 
gravitational waves, 
which were predicted 
100 years ago by Albert 
Einstein, will be made at 
the National Press Club 
in Washington DC. 
go.nature.com/cqt5te 


12 FEBRUARY 
Astro-H, an X-ray 
astronomy satellite led 
by the Japanese Space 
Agency, will launch 
from Tanegashima 
Space Center, Japan (see 
page 140 for more). 


14-19 FEBRUARY 
Whitefly, which can 
devastate crops, is 
discussed at the 2nd 
International Whitefly 
Symposium in Arusha, 
Tanzania. 

www. iita.org/iws2016 


16 FEBRUARY 
Sentinel-3A, a European 
Space Agency 
Earth-observing 
satellite, launches from 
Plesetsk, Russia. 
go.nature.com/hgadp7 


Rev. 1, CD005992; 2016). A 
group of systematics reviewers 
known as the Cochrane 
Collaboration assessed 
research from 21 countries 
and found improvements 

in cardiovascular health 

and reduced mortality from 
smoking-related illnesses since 
the first national ban, in Ireland 
in 2004. Effects on respiratory 
health, however, were “less 
consistent”. In total, 77 studies 
were assessed by the Cochrane 
team, and the findings 
strengthen the conclusions 
made in a previous review in 
2010, the researchers say. 


> NATURE.COM 
For daily news updates see: 
WwW.nature.com/news 
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Sexual-harassment 
cases pose challenge for 
funders p.138 


Astronomers hope it’s 
third time lucky for Japanese 
probe p.140 


usually-secret 


replication failures p.141 


Company publishes 


Industry 
prepares to abandon 
pursuit of Moore’s law p.144 


AUSTRALIA 


Blazes threaten iconic 


Charred pencil pines mark a fire-ravaged landscape in Tasmania, Australia. 


trees 


As Tasmanian climate warms, bushfires are encroaching on forest ecosystems that 
date back more than 180 million years. 


BY EMMA MARRIS 


have burned large areas of north- 
western Tasmania since mid-January, 
consuming more than 110,000 hectares — 
including almost 2% of the region’s United 
Nations World Heritage wilderness lands. 
Large Australian bushfires have become 
so common that they hardly seem like news, 
but these are different. Rather than burn- 
ing swathes of fire-adapted vegetation such 


B ushfires sparked by lightning storms 


as Eucalyptus or Banksia, many of the recent 
blazes have raged through ecosystems that are 
not adapted to fire at all — relict forests from 
when Tasmania was part of the Gondwana 
supercontinent, more than 180 million 
years ago. 

When fires burn the pencil pines (Ath- 
rotaxis cupressoides) and King Billy pines 
(Athrotaxis selaginoides) that dominate these 
high-altitude ecosystems, they kill not only 
the trees but also their seeds — and inciner- 
ate the peaty soil in which they grow. In the 


hardest-hit areas, natural regeneration of 
these ancient forests is all but impossible. 
The trees, many of which are more than 
1,000 years old, have been able to survive in 
Tasmania so far because its climate is colder 
and wetter than that of mainland Australia. But 
climate change may tip the odds against them. 
The ‘dry lightning strikes — from thunder- 
storms in which most rain evaporates before 
reaching the ground — that ignited the current 
fires used to be vanishingly rare in western Tas- 
mania, but climate change is thought to have 
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FEELING THE BURN 


Since mid-January, 
bushfires have blazed in 
northwestern Tasmania, 
threatening forest 
ecosystems that have not 
evolved to cope with such 
fires. Some of these 
landscapes date back more 
than 180 million years. 


@ Fires 
' World heritage site 
i Forest reserve 
BB National park 
IB State or regional reserve 
i Conservation area 
Other designation 
Indian 
—— Ocean 
25 km 


> made such storms more common in recent 
years. Tasmania saw one of the driest years on 
record in 2015. Furthermore, logging and dry 
conditions in the rainforest that surrounds these 
alpine forests have reduced its ability to act asa 
firebreak. 

“There was no doubt pencil pine was on the 
mainland, but the fire and climate regime meant 
it couldnt persist,” says David Lindenmayer, 
a forest ecologist at the Australian National 
University in Canberra. “If Tasmania is going 
to become more like the mainland, there is a 
distinct possibility that its time is going to be 
done. That is a huge loss for the world.” 

Just a small percentage of the alpine and 
central-plateau fire-sensitive ecosystems in 
northwestern Tasmania have been killed by the 
blazes; 73 fires were still burning on 9 February. 
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Butif fires become more frequent, that may spell 
the beginning of the end. 

“If you have 2% [burning] every 10 or 
15 years, it is not long before it is pushed into 
only the most fire-protected refuges,” says 
Jamie Kirkpatrick, a geographer and conser- 
vation ecologist at the University of Tasmania 
in Hobart. 

To the chagrin of ecologists and bushwalkers, 
the fires have threatened several iconic land- 
scapes, including the Walls of Jerusalem 
National Park and Mount Anne, the highest 
peak in southwestern Tasmania (see ‘Feeling 
the burr’). Hobart-based photographer Dan 
Broun and a colleague hiked up to a burned 
alpine area last weekend. When they crested 
the plateau, they were greeted by a dead world. 
“We were in shock. What we were seeing was 


complete and utter devastation,” says Broun. 

There have already been discussions about 
reseeding the burnt areas, but experts warn 
that it may not work; it is not at all clear that the 
strategy would make sense in a ‘new normal’ 
in which fires are predicted to run rampant in 
the area. Furthermore, grazing by wallabies 
and wombats keeps many seedlings from ever 
growing tall enough to reproduce. The exist- 
ing pines may date from a slender window of 
time when heavy snows kept grazing animals 
away for long enough for the trees to grow out 
of their reach — a process that takes 50 years. 

With threats to the region’s iconic forests 
clear, University of Tasmania ecologist David 
Bowman has called for an increased effort to 
collect seeds of fire-sensitive Tasmanian species 
and conserve them in another location, perhaps 
even on a sub- Antarctic island that would be 
safer from fire. 

That would require a lot of seeds. Luckily, 
2015 was a mast year — a period in which 
trees produce unusually large crops of cones 
and seeds — for many high-altitude conifers 
in Tasmania. In March and May, government 
biologists collected 1.57 million viable seeds 
from pencil and King Billy pines, as well as the 
shrub-like Cheshunt pine (Diselma archeri) 
and drooping pine (Pherosphaera hookeriana). 

“We knew these sorts of [fire] events were 
likely in the future and the species we were 
looking at were so vulnerable to fire,” says James 
Wood, head of the Tasmanian Seed Conserva- 
tion Centre at the Royal Tasmanian Botanical 
Gardens in Hobart. “Effectively, we are a Noah's 
ark. We are just trying to hold onto material so 
it is not lost permanently.” m 


DATABASE OF PROTECTED AREAS/TASMANIAN FIRE SERVICE 


SOURCE: MAP, GRANT WILLIAMSON/UNIV. TASMANIA; DATA, IUCN WORLD 


US science agencies grapple 
with sexual harassment 


Funders threaten harsh penalties, but many have yet to take action. 


BY ALEXANDRA WITZE 


abricating, falsifying or plagiarizing data 
iz can get a grant yanked or a researcher 
blacklisted for breaking the professional 
code of science. Now, some funders are facing a 
fresh challenge: what to do with grants given to 
scientists who commit sexual transgressions. 
The US government does not classify sexual 
infractions as research misconduct. Instead, 
as recent high-profile cases illustrate, the 
National Science Foundation (NSF), NASA 
and the National Institutes of Health (NIH) 


must navigate a relatively new legal landscape 
when confronted with sexual harassment by 
grant recipients. What is clear, specialists in 
research ethics say, is that agencies, institu- 
tions and researchers all need to improve their 
response to such behaviour. 

“The public has a right for us to conduct 
publicly funded work honourably and with 
integrity,’ says C. K. Gunsalus, director of the 
National Center for Professional and Research 
Ethics at the University of Illinois at Urbana- 
Champaign. 

The latest scandal broke on 2 February, 
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when The New York Times reported that 
molecular biologist Jason Lieb had left the Uni- 
versity of Chicago in Illinois after an internal 
investigation found that he had made unwel- 
come sexual advances to female graduate stu- 
dents, and had engaged in sexual activity with 
a student who was incapacitated and could 
not consent. In January, it became public that 
Christian Ott, a theoretical astrophysicist at the 
California Institute of Technology in Pasadena, 
was on unpaid leave for violating the institu- 
tion’s sexual-harassment policy. And last Octo- 
ber, astronomer Geoffrey Marcy announced 


LAURIE HATCH 


that he would retire from the University of 
California, Berkeley, after a similar verdict. 

Lieb, Ott and Marcy have each brought 
millions in government research dollars to 
their universities, which must now decide 
how to handle the money. Funding agencies 
and institutions must comply with Title IX, 
a federal law that forbids sex discrimination 
(which, in legal terms, can include harassment 
or assault) in any educational programme that 
receives government money. The law has been 
on the books since 1972, but in 2011, the US 
Department of Education said that it would 
step up its enforcement of the sexual-harass- 
ment aspects. 

To comply with Title IX, institutions must 
have a representative who investigates and 
resolves allegations of sex discrimination. 
The inquiries into Lieb, Ott and Marcy all went 
through the Title IX offices at their respective 
universities. 

Once a Title IX investigation is complete, the 
institution must decide whether to take disci- 
plinary action. A funding agency can open its 
own inquiry and levy extra penalties ifit deems 
them necessary. NASA and the NSF have both 
put out statements recently saying that they 


CODES OF CONDUCT 


Countries have very different laws governing 
how to deal with sexual misconduct in recipients 


of federal research grants. Some link it closely to 
research misconduct; others address the issue 
as more of a workplace violation. 


CANADA 

A three-agency framework for responsible conduct 
of research says that “researchers shall follow the 
requirements of applicable institutional policies 
and professional or disciplinary standards”. 


AUSTRALIA 

The Australian Code for the Responsible Conduct 
of Research classifies “other forms of 
misconduct, such as harassment, bullying or 
financial misconduct” as covered by employee 
relationships with their institutions. 


EUROPE 
The European Code of Conduct for Research 
Integrity includes, among its principles of 
integrity, the “responsibility for the scientists 
and researchers of the future”. 


UNITED STATES 

A phrase that covers sexual infractions was 
removed from the definition of research 
misconduct in 2000. But funders must adhere 
to a law known as Title IX, which prohibits 
discrimination on the basis of sex. 


The Lick Observatory in California is home to an exoplanet search once run by Geoffrey Marcy, who left 
the University of California, Berkeley, after a sexual-harassment investigation. 


do not tolerate sexual harassment by grantees; 
the NSF even threatened to pull funds entirely 
from institutions that do not comply with Title 
IX. But it has never banned a grantee, let alone 
an institution, for violating Title IX. 

“Statements are good for changing the 
culture, but they have to be supported by 
action,” says Katie Hinde, an anthropologist at 
Arizona State University in Tempe. 

The process is more complicated than 

simply yanking existing grants. “People come 
to us and say, why don't you fix it?” says James 
Ulvestad, head of the NSF’s astronomy divi- 
sion. “Well, what the funding agencies can do 
is what federal law allows us to do.” 

In general, funding awards are made to 
institutions, not the person who is the princi- 
pal investigator (PI) for the work. Even ifa PI 
has been found to violate institutional policies, 
his or her grant money will continue to flow to 
support graduate students, postdoctoral fellows 
and other collaborators on an affected project. 

Lieb is PI or co-PI on more than US$1.2 mil- 
lion in NIH grants, and the University of Chi- 
cago may nominate someone to replace him. 
Ott is involved in more than $3.2 million in 
NSF funds; those grants are under review but 
their status remains unchanged for now, says 
NSF spokeswoman Ivy Kupec. Ott still has 
access to NSF facilities, including the Blue 
Waters supercomputer in Illinois, on which 
he runs simulations of supernova explosions. 

Marcy had many lucrative private grants, 
including a share of the $100-million Break- 
through Listen search for extraterrestrial intel- 
ligence. Those have been transferred to new PIs. 


His old employer is now working on designating 
new PIs for the two NASA grants totalling 
roughly $1 million that are in Marcy’s name. 

It was actually a case of sexual assault 
that prompted the modern US definition of 
research misconduct, says Nicholas Steneck, 
a specialist in research integrity at the Uni- 
versity of Michigan in Ann Arbor. In 1989, 
the NSF received complaints about the 
actions of a senior researcher while taking 
undergraduates to a research site in another 
country. The agency’s inspector-general ulti- 
mately found the researcher to have commit- 
ted “16 incidents of sexual misfeasance with 
female graduate and undergraduate students 
at the research site; on the way to the site; and 
in his home, car, and office”. Many could be 
classified as sexual assaults. 

The agency barred him from receiving federal 
funding for five years, according to a definition 
of misconduct that at that time included “seri- 
ous deviations” from accepted research prac- 
tices. After a fierce battle over the meaning of 
the phrase, agencies excised it from their defi- 
nitions of misconduct in 2000. “Sexual harass- 
ment shouldn't be tolerated,’ Steneck says. “But 
it’s not so easy to say, ‘It’s research misconduct 
and that’s the way it ought to be handled’ 

US agencies might learn a thing or two from 
funders in other countries, Steneck adds. Both 
Canada and Australia (see ‘Codes of conduct’) 
require federally funded scientists to meet a 
minimum ethical standard that specifically 
describes institutional roles. 

“You need to set out a clear code of what to 
expect,” he says. m 
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SPECTRA IN SPACE 


The Japanese space observatory 
ASTRO-H will detect an 
unusually broad range of 
wavelengths, from low-energy, 
‘soft’ X-rays to high-energy, 
‘hard’ X-rays and soft y-rays. 
The Soft X-ray Spectrometer 
(SXS) will be able to produce 
high-resolution spectra for wide 
objects such as galaxy clusters. 


Soft X-ray Imager 
SxS 

Soft y-ray Detector 
Hard X-ray Imager 


Soft X-ray Telescopes 
Hard X-ray Telescope 


Extendable boom 
(6.4 metres) 


LENGTH Approx. 14 m | MASS 2.7 tonnes | POWER 3,500 watts | LIFE 3 years 


ASTRONOMY 


High stakes for 
Japan’s space probe 


ASTRO-H will carry a technology that two earlier, ill-fated 
space telescopes failed to put into action. 


BY DAVIDE CASTELVECCHI 


stronomers around the world are 
Am that the third time will be the 
charm as Japan prepares to launch its 
largest space observatory ever. The telescope 
will use X-rays to study phenomena from black 
holes to dark matter and carries cryogenic 
imaging technology that flew on two previous 
missions — but met disaster both times. 
Weather permitting, a Japanese-built H-IIA 
rocket is scheduled to launch the probe, provi- 
sionally named ASTRO-H, from Tanegashima 
Space Center at 17:45 local time on 12 Febru- 
ary. Once in orbit, the 2.7-tonne probe will 
stretch out to its maximum length of 14 metres, 
including a 6.4-metre boom that will host an 
imager capable of collecting high-energy, or 
‘hard’ X-ray photons (see ‘Spectra in space’). 
The Japan Aerospace Exploration Agency 
(JAXA) leads the mission with an investment 
of ¥31 billion (US$265 million), but there is 


also major participation from NASA, as well 
as institutions in six other nations and the 
European Space Agency (ESA). 

Studying X-ray emissions is the best way 
to observe a wide range of cosmic phenom- 
ena, from galaxy clusters to the super-heated 
accretion disks around black holes. But Earth's 
atmosphere is mostly opaque to radiation out- 
side the visible spectrum, and particularly to 
X-rays and y-rays, meaning that most X-ray 
astronomy requires a satellite. 

The major existing X-ray satellites are 
NASAs Chandra X-ray Observatory and ESAs 
XMM-Newton, which both launched in 1999. 
These can analyse the constituent wavelengths 
of X-rays — the spectra — emitted by point- 
like objects such as stars. But ASTRO-H will 
be the first to provide high-resolution spectra 
for much more spread-out X-ray sources such 
as galaxy clusters, says Norbert Schartel, project 
scientist for XMM-Newton at ESAs European 
Space Astronomy Centre outside Madrid, who 


is also a member of ASTRO-H’s ESA team. 

Spectra carry information about the veloc- 
ity and turbulence of the plasma that per- 
vades galaxy clusters, which in turn can reveal 
whether, and how, a cluster resulted from a 
merger of two smaller ones, says Christine 
Forman, a high-energy astrophysicist at Har- 
vard University in Cambridge, Massachusetts. 
X-ray pictures of clusters, when combined with 
visible-light images, have also previously pro- 
vided striking — although indirect — evidence 
for the existence of dark matter. ASTRO-H 
should be able to help settle whether a 3.5-kilo- 
electronvolt X-ray signal seen in certain galax- 
ies is a signature of dark matter decaying into 
photons — or something else, says Alexey 
Boyarsky, an astrophysicist at Leiden Univer- 
sity in the Netherlands who co-discovered the 
signal (see Nature 517, 422-423; 2015). 

ASTRO-H will also cover a broader range of 
wavelengths than most other missions, from 
‘soft, low-energy photons starting at 300 eV, 
through hard X-rays and right up to soft 
y-rays of 600 keV. But only the soft X-rays will 
have their spectra imaged to high resolution. 
At the heart of the instrument responsible for 
this — the Soft X-ray Spectrometer (SXS) — is 
an array of 36-pixel elements that must be kept 
at 0.05 degrees above absolute zero, well inside 
the main body of the craft. When a photon 
strikes one of the sensors, the temperature of 
the sensor rises slightly, causing its electrical 
conductivity to increase. The resulting change 
in voltage can be used to measure the energy 
of the original photon — and therefore its 
wavelength — with a precision of nearly one 
partin 1,000. 

The technology first took shape in 1984, 
when Richard Kelley, an astrophysicist at 
NASA’ Goddard Space Flight Center in Green- 
belt, Maryland, started designing sensors for 
Chandra. After NASA scaled back its ambitions 
for the observatory and scrapped that plan, 
Kelley and his team provided the spectrometer 
for a Japanese X-ray telescope called ASTRO-E. 
But in 2000, the rocket that was supposed to 
put it into orbit crashed shortly after take-off. 

JAXA prepared a replacement mission, 
Suzaku, which reached orbit in July 2005 — 
only for disaster to strike again. The liquid 
helium used to keep a spectrometer’s sensors 
cold has to be slowly released, or outgassed. But 
in Suzaku, a tiny amount of that helium accu- 
mulated inside the craft. This was enough to 
ruin the vacuum that was supposed to insulate 
the helium tank from the rest of the craft. The 
tank heated up faster than expected, causing 
the helium to boil offand vent into space within 
four weeks of launch. Unable to stay super- 
chilled, the spectrometer was crippled before 
it could start making observations, although 
Suzaku’s other instruments continued to oper- 
ate until the craft was decommissioned in 2015. 

For ASTRO-H, JAXA redesigned the tank 
with plumbing to outgas the helium straight 
into space. And to further guard against glitches, 
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SOURCE: JAXA 


JAXA 


ASTRO-H project manager Tadayuki 
Takahashi, an astrophysicist at the Uni- 
versity of Tokyo and ASTRO-H’s project 
manager, pushed his collaboration to work 
without borders. “Usually, international 
coalitions have clearly defined interfaces,” 
with different laboratories providing isolated 
components of a spacecraft and its payload, 
says Takahashi. But ASTRO-H researchers 
regularly visited each other’s labs, sometimes 
for months at a time. 

Kelley says that Takahashi forged a very 
open collaboration. “Tad understands that 
if you want to maximize the chances of suc- 
cess, you have to have no barriers,” he says. 
“Everybody has access to everything” 

Astronomers around the world will be 
allowed to request observing time with 
ASTRO-H. Each team will have exclusive 
access to the resulting data for one year, 
after which JAXA will make them pub- 
licly available — a model long adopted by 
NASA. ASTRO-H will be renamed after 
launch, although what it will be called is 
yet to be determined. 

A larger, higher-resolution version of the 
SXS is due to fly aboard Athena, an ESA-led 
X-ray astronomy mission planned for the 
late 2020s. m 


Additional reporting by David Cyranoski. 


The ASTRO-H 
probe is preparing 
for launch. 


REPLICATION 


IN FOCUS | NEWS 


Biotech giant posts 
negative results 


Amgen papers seed channel for discussing reproducibility. 


BY MONYA BAKER 


biotechnology firm is releasing data on 
A= failed efforts to confirm findings 

published in high-profile scientific 
journals — details that the industry usually 
keeps secret. 

Amgen, headquartered in Thousand Oaks, 
California, says that it hopes the move will 
encourage others to describe their own rep- 
lication attempts, and thus help the scientific 
community to get to the bottom of work that 
other labs are having trouble verifying. 

The data are posted online on a newly 
launched channel dedicated to quickly 
publishing researchers’ efforts to confirm sci- 
entific findings (see go.nature.com/3zzea9), 
hosted by F1000Research, the publishing plat- 
form of London-based publishers Faculty of 
1000 (F1000). 

The idea emerged from discussions at a 
meeting in 2015 focused on improving scien- 
tific integrity. Sasha Kamb, who leads research 
discovery at Amgen, said that his company’s 
scientists have many times failed to reproduce 
academic studies but that it takes too much 
effort to publish these accounts. 

Bruce Alberts, a former editor-in-chief of 
Science who sits on F1000Research’s advisory 
board, suggested that Kamb try the faster 
F1000 route — an open-science publishing 
model in which submitted studies are posted 
online (for a fee that ranges from US$150 to 
$1,000) before undergoing open peer review; 
submissions are subject to checks by F1000 
editors to ensure that data are freely available 
and that methods are adequately described. 
“The idea is to get the data out and get it 
critically looked at,’ Alberts says. 


IRREPRODUCIBLE HISTORY 
In 2012, Amgen researchers declared that they 
had been unable to reproduce the findings in 
47 of 53 ‘landmark’ cancer papers’. Those 
papers were never identified — partly because 
of confidentiality concerns — and there are no 
plans to release details now, says Kamb, who 
was not involved with that work. 

The three latest studies that Amgen has 
posted deliberately do 
not make a detailed com- 
parison of their results 
to previous papers, says 
Kamb. “We don't want to 


> NATURE.COM 
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make strong conclusions that someone else's 
work is wrong with a capital W,” he says. 

One study adds to existing criticism of a 
Science paper’ that suggested a cancer drug 
might bea potential treatment for Alzheimer’s 
disease; a second counters earlier findings 
(including some by Amgen researchers) 
connecting a gene to insulin sensitivity in 
mice”; and a third counters a Nature paper” 
reporting that inhibiting one particular pro- 
tein could enhance the degradation of others 
associated with neurodegenerative diseases. 
Amgen researchers did not contact the origi- 
nal authors when they conducted their studies, 
Kamb says. 


GAINING TRACTION 

The main way that the scientific community 
spreads the word about irreproducible research 
is through insinuation, which is inefficient and 
unfair to the original researchers, says Ricardo 
Dolmetsch, global head of neuroscience at 
Novartis’s Institutes for Biomedical Research 
in Cambridge, Massachusetts. “Anything we 
can do to improve the ratio of signal to noise in 
the literature is very welcome,’ he says. 

The F1000 initiative is useful, but previous 
efforts have tried and failed to encourage the 
reporting of replications and negative results, 
cautions John Ioannidis, who studies scientific 
robustness at California's Stanford University. In 
general, the scientific community undervalues 
such work, he says. 

But Kamb says that he has spoken with 
several industry leaders who have expressed 
support, and he hopes that they will contribute 
eventually. Morgan Sheng, a vice-president at 
biotechnology company Genentech in South 
San Francisco, says that he can forsee his com- 
pany’s scientists submitting data to the venture. 
“T believe the main risk of a publication venue 
like the F1000 channel is that it becomes a 
place for ‘bashing’ good science, because bio- 
logical experiments are complex and beset by 
many variables that are hard to control. Non- 
replication does not necessarily mean ‘not 
true,” Sheng adds. = 
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US panel backs 
‘three-person’ 
embryos 


But regulators cannot 
approve clinical trials. 


BY SARA REARDON 


r Vhe US Food and Drug Administration 
(FDA) should approve clinical trials 
of a gene-therapy technique to create 

embryos with genetic material from three 

people — if certain conditions are met, the US 

National Academies of Sciences, Engineering 

and Medicine said on 3 February. 

The controversial technique involves replac- 
ing the faulty energy-producing mitochondria 
in a woman's egg with mitochondria from the 
egg of a healthy donor. The aim is to prevent 
the transmission of disabling or fatal diseases 
caused by mutations in a mother’s mitochon- 
drial DNA. But concerns about the technique’s 
safety, and the psychological and social impli- 
cations of children with three genetic parents, 
have given US regulators pause. Moreover, a 
federal law approved late last year prevents the 
FDA from allowing any such trials in humans. 

The academy panel suggests limiting the 
tests of mitochondrial replacement to male 
embryos asa safety precaution. Male offspring 
would not be able to pass their modified mito- 
chondria to future generations, because a child 
inherits its mitochondria from its mother. 

The report also outlines several extra steps 
to monitor the safety of mitochondrial replace- 
ment, such as attempting to track the children 
born as a result of the technique and sharing 
the resulting data with scientists and the pub- 
lic. If mitochondrial replacement is proved 
safe in male offspring, it could be expanded to 
female embryos, the advisory panel said. 

By contrast, the United Kingdom last year 
approved mitochondrial replacement with no 
restrictions on the sex of a modified embryo. 

Shoukhrat Mitalipov, a reproductive- 
biology specialist at the Oregon Health and 
Science University in Portland, sees the US 
advisory panel report as a hollow victory. In 
2014, he applied to the FDA to perform a clini- 
cal trial of mitochondrial replacement therapy 
with human embryos — a trial that cannot take 
place under current US law. 

Others disagree with the panel’s conclusions. 
The new report “leapt to the conclusion that 
things should go forward despite all the con- 
cerns’, says Marcy Darnovsky, executive direc- 
tor of the non-profit Center for Genetics and 
Society in Berkeley, California. m 


Pregnant women infected with the Zika virus are being monitored for effects on their unborn babies. 


PUBLIC HEALTH 


Scientists probe Zika 
link to birth defects 


Mosquito-borne virus is now present in 33 countries. 


BY ERIKA CHECK HAYDEN 


ublic-health authorities are inves- 
Pies whether the Zika virus has 

caused an apparent surge in the num- 
ber of infants born with microcephaly, or 
abnormally small heads, in at least seven 
countries. But conclusively determining 
whether the mosquito-borne virus is to blame 
could take months to years, researchers say. 

Concerns first arose in Brazil, which in 
November declared a national public-health 
emergency. As of 2 February, officials there 
had investigated 1,113 of 4,783 suspected 
cases of microcephaly reported since late last 
year, and confirmed that 404 of those could 
be linked to Zika. On 1 February, a commit- 
tee convened by the World Health Organiza- 
tion said that a causal link between Zika and 
microcephaly is “strongly suspected, though 
not yet scientifically proven’. 

That is not for a lack of effort. Work now 
under way includes case-control studies to 
compare rates of Zika infection in babies 
who are born with microcephaly and in those 
without it, as well as genetic sequencing of 


the virus and efforts to develop a molecular 
diagnostic test for Zika infection. 

Making progress has been difficult 
because scientists know relatively little about 
Zika; there is no easy-to-use test to diagnose 
infections; and physicians disagree about 
how to define microcephaly, says Bruno 
Andrade, an immunologist at the Fiocruz 
research institute in Bahia, Brazil. “All of this 
started less than two months ago — that’s 
when everything stepped up,’ he says. “We 
are in the middle of this nightmare here.” 

So far, two lines of evidence support a link 
between the virus and microcephaly. Micro- 
cephaly cases in Brazil started to rise around 
6 months after authorities confirmed Zika 
transmission there, hinting that the defect 
might have been caused by in utero expo- 
sure to the virus. And researchers in Brazil 
have found traces of the virus, or antibodies 
to it, in the amniotic fluid, brains or spinal 
fluid of 15 fetuses and babies diagnosed with 
microcephaly. 

This is suggestive, but not conclusive. “Most 
of us believe it’s highly plausible that Zika is 
the cause of this epidemic of microcephaly, 
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but we need additional evidence,” says Albert 
Ko, an infectious-disease physician and epide- 
miologist at the Yale School of Public Health in 
New Haven, Connecticut. 

In the hope of producing more-definitive 
data, the Brazilian ministry of health is now 
setting up large studies. In one, researchers 
will follow up 6,000 pregnant women in 
northeastern Brazil to investigate the effects 
of Zika and of microcephaly. 


KNOWLEDGE GAPS 
Epidemiological studies are often complex 
because Zika causes a relatively mild illness 
in adults and there is no widely deployed test 
for the virus. This means that most of the 
mothers who have participated in previous 
studies were never diagnosed with Zika, even 
if they had it. To address this problem, the 
Brazilian health ministry is now asking moth- 
ers whether they had Zika symptoms instead 
of whether they were diagnosed with Zika. 
But many researchers say that epidemio- 
logical data alone will not convince them of 
a link between Zika and microcephaly; they 
would like to see evidence of how and why the 
virus causes the condition. With this in mind, 
scientists are developing animal models to 
investigate Zika’s effects on the body, such as 
which tissues it infects, and why fetal brains 
might be especially vulnerable. 


“There’s a lot of basic work and research that 
we need to do,’ says Anthony Fauci, director of 
the US National Institute of Allergy and Infec- 
tious Diseases in Bethesda, Maryland. 

It’s plausible, for instance, that Zika crosses 
from mother to baby through the placenta, 
as do some related viruses, such as West Nile 
virus. But these other viruses don’t often cause 
infant brain damage, so it’s not clear why — or 

how — Zika might, 


“There’salotof says David Morens, 
basic work and a senior adviser to 
researchthatwe Fauci. 


need to do.” The virus may be 
toxic only while a 
fetus’s brain is still developing its major struc- 
tures, in the first two months of pregnancy. 
Or it may persist in the body for a long period, 
which would explain why Zika is seen in still- 
born babies with microcephaly. “If the insult 
happened early on, then why is the virus pre- 
sent at seven months when the miscarriage 
occurs?” Morens says. “There must be a com- 
bination of things going on.” 

Another conundrum is what it is that 
makes these women and babies so vulnerable: 
the vast majority of women infected with Zika 
go on to have healthy babies. 

But whatever scientists ultimately dis- 
cover about Zika, another major chal- 
lenge remains: supporting the children 
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born with microcephaly. Many are living in 
what Ko calls a “vegetative state’, and may 
go on to have seizures. That could be diffi- 
cult for their families — many of which are 
impoverished — to bear. 

“We have to start settling down and think 
about how we're going to take care of these 
kids,’ Ko says. If there is a link between Zika and 
microcephaly, the number of babies affected by 
the condition could soar as the virus spreads, 
he cautions. “We have no idea how big it’s going 
to get” m 


CORRECTIONS 

The two elements in the key for the map 

in the News story ‘Microcephaly surge 

in doubt’ (Nature 530, 13-14; 2016) 

were accidentally swapped round. The 
corrected graphic can be seen at go.nature. 
com/ekmksk. The dollar conversion of 
€4.6 billion in the News story ‘Science hubs 
win in major revamp’ (Nature 530, 18-19; 
2016) should have been US$5 billion not 
$5 million. The News story ‘Academics 
across Europe join ‘Brexit’ debate’ (Nature 
530, 15; 2016) erroneously stated that the 
latest UK contribution to the EU budget 
was €140 billion. In fact, it contributed 
around €17 billion. 
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ILLUSTRATION BY REBECCA MOCK 


THE SEMICONDUCTOR INDUSTRY 
WILL SOON ABANDON ITS PURSUIT 
OF MOORE'S LAW. 

NOW THINGS COULD GET ALOT 
MORE INTERESTING. 


ext month, the worldwide semiconductor industry will formally 
acknowledge what has become increasingly obvious to every- 
one involved: Moore's law, the principle that has powered the 
information-technology revolution since the 1960s, is nearing 
its end. 

A rule of thumb that has come to dominate computing, Moore’s 
law states that the number of transistors on a microprocessor chip 
will double every two years or so — which has generally meant that 
the chip’s performance will, too. The exponential improvement that 
the law describes transformed the first crude home computers of the 
1970s into the sophisticated machines of the 1980s and 1990s, and from 
there gave rise to high-speed Internet, smartphones and the wired-up 
cars, refrigerators and thermostats that are becoming prevalent today. 

None of this was inevitable: chipmakers deliberately chose to stay on 
the Moore's law track. At every stage, software developers came up with 
applications that strained the capabilities of existing chips; consumers 
asked more of their devices; and manufacturers rushed to meet that 
demand with next-generation chips. Since the 1990s, in fact, the semi- 
conductor industry has released a research road map every two years to 
coordinate what its hundreds of manufacturers and suppliers are doing 
to stay in step with the law — a strategy sometimes called More Moore. 
Ithas been largely thanks to this road map that computers have followed 
the law’s exponential demands. 

Not for much longer. The doubling has already started to falter, 
thanks to the heat that is unavoidably generated when more and more 
silicon circuitry is jammed into the same small area. And some even 
more fundamental limits loom less than a decade away. Top-of-the- 
line microprocessors currently have circuit features that are around 
14 nanometres across, smaller than most viruses. But by the early 2020s, 
says Paolo Gargini, chair of the road-mapping organization, “even with 
super-aggressive efforts, we'll get to the 2-3-nanometre limit, where 
features are just 10 atoms across. Is that a device at all?” Probably not 
— if only because at that scale, electron behaviour will be governed by 
quantum uncertainties that will make transistors hopelessly unreliable. 
And despite vigorous research efforts, there is no obvious successor to 
today’s silicon technology. 

The industry road map released next month will for the first 
time lay out a research and development plan that is not centred on 
Moore’s law. Instead, it will follow what might be called the More 
than Moore strategy: rather than making the chips better and letting 
the applications follow, it will start with applications — from smart- 
phones and supercomputers to data centres in the cloud — and work 
downwards to see what chips are needed to support them. Among 
those chips will be new generations of sensors, power-management 
circuits and other silicon devices required by a world in which com- 
puting is increasingly mobile. 

The changing landscape, in turn, could splinter the industry’s long 
tradition of unity in pursuit of Moore's law. “Everybody is struggling 
with what the road map actually means,’ says Daniel Reed, a computer 
scientist and vice-president for research at the University of lowa in Iowa 
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City. The Semiconductor Industry Association (SIA) in Washington DC, 
which represents all the major US firms, has already said that it will cease 
its participation in the road-mapping effort once the report is out, and 
will instead pursue its own research and development agenda. 

Everyone agrees that the twilight of Moore’s law will not mean the 
end of progress. “Think about what happened to airplanes,” says Reed. 
“A Boeing 787 doesn't go any faster than a 707 did in the 1950s — but 
they are very different airplanes’, with innovations ranging from fully 
electronic controls to a carbon-fibre fuselage. That’s what will happen 
with computers, he says: “Innovation will absolutely continue — but it 
will be more nuanced and complicated” 


LAYING DOWN THE LAW 

The 1965 essay’ that would make Gordon Moore famous started with 
a meditation on what could be done with the still-new technology of 
integrated circuits. Moore, who was then research director of Fairchild 
Semiconductor in San Jose, California, predicted wonders such as 
home computers, digital wristwatches, automatic cars and “personal 
portable communications equipment” — mobile phones. But the heart 
of the essay was Moore’s attempt to provide a timeline for this future. 
Asa measure of a microprocessor’s computational power, he looked at 
transistors, the on-off switches that make computing digital. On the 
basis of achievements by his company and others in the previous few 
years, he estimated that the number of transistors and other electronic 
components per chip was doubling every year. 

Moore, who would later co-found Intel in Santa Clara, California, 
underestimated the doubling time; in 1975, he revised it to a more real- 
istic two years’. But his vision was spot on. The future that he predicted 
started to arrive in the 1970s and 1980s, with the advent of micropro- 
cessor-equipped consumer products such as the Hewlett Packard hand 
calculators, the Apple II computer and the IBM PC. Demand for such 
products was soon exploding, and manufacturers were engaging in a 
brisk competition to offer more and more capable chips in smaller and 
smaller packages (see ‘Moore’s lore’). 

This was expensive. Improving a microprocessor’s performance 
meant scaling down the elements of its circuit so that more of them 
could be packed together on the chip, and electrons could move between 
them more quickly. Scaling, in turn, required major refinements in 
photolithography, the basic technology for etching those microscopic 
elements onto a silicon surface. But the boom times were such that this 
hardly mattered: a self-reinforcing cycle set in. Chips were so versa- 
tile that manufacturers could make only a few types — processors and 
memory, mostly — and sell them in huge quantities. That gave them 
enough cash to cover the cost of upgrading their fabrication facilities, 
or ‘fabs; and still drop the prices, thereby fuelling demand even further. 

Soon, however, it became clear that this market-driven cycle could not 
sustain the relentless cadence of Moore’s law by itself. The chip-making 
process was getting too complex, often involving hundreds of stages, 
which meant that taking the next step down in scale required a net- 
work of materials-suppliers and apparatus-makers to deliver the right 
upgrades at the right time. “If you need 40 kinds of equipment and only 
39 are ready, then everything stops,’ says Kenneth Flamm, an economist 
who studies the computer industry at the University of Texas at Austin. 

To provide that coordination, the industry devised its first road map. 
The idea, says Gargini, was “that everyone would have a rough estimate 
of where they were going, and they could raise an alarm if they saw 
roadblocks ahead”. The US semiconductor industry launched the map- 
ping effort in 1991, with hundreds of engineers from various companies 
working on the first report and its subsequent iterations, and Gargini, 
then the director of technology strategy at Intel, as its chair. In 1998, 
the effort became the International Technology 
Roadmap for Semiconductors, with participa- 
tion from industry associations in Europe, Japan, 
Taiwan and South Korea. (This year’s report, in 
keeping with its new approach, will be called the 
International Roadmap for Devices and Systems.) 
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MOORE’S LORE 


For the past five decades, the number of transistors per microprocessor 
chip — a rough measure of processing power — has doubled about every 
two years, in step with Moore’s law (top). Chips also increased their ‘clock 
speed’, or rate of executing instructions, until 2004, when speeds were 
capped to limit heat. As computers increase in power and shrink in size, a 
new class of machines has emerged roughly every ten years (bottom). 


101° 


a 


108 - 
eo? 


Hoos ® 


es 
oe? @gge e0ee © 
Transistors per chip Rn) 
102 e 


104- 


e 
Clock speeds (MHz) 


T T r 
1960 1974 1988 


a 

Sy 2S 
x 
tran 


Size (mm?) 
BR 
= 


tr ee ee 


My 25 
Oe 
Oe 
ile tial ov 


“The road map was an incredibly interesting experiment,” says 
Flamm. “So far as I know, there is no example of anything like this in 
any other industry, where every manufacturer and supplier gets together 
and figures out what they are going to do.” In effect, it converted Moore's 
law from an empirical observation into a self-fulfilling prophecy: new 
chips followed the law because the industry made sure that they did. 

And it all worked beautifully, says Flamm — right up until it didn't. 


HEAT DEATH 

The first stumbling block was not unexpected. Gargini and others had 
warned about it as far back as 1989. But it hit hard nonetheless: things 
got too small. 

“Tt used to be that whenever we would scale to smaller feature size, 
good things happened automatically,” says Bill Bottoms, president of 
Third Millennium Test Solutions, an equipment manufacturer in Santa 
Clara. “The chips would go faster and consume less power.” 
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But in the early 2000s, when the features began to shrink below about 
90 nanometres, that automatic benefit began to fail. As electrons had 
to move faster and faster through silicon circuits that were smaller and 
smaller, the chips began to get too hot. 

That was a fundamental problem. Heat is hard to get rid of, and no 
one wants to buy a mobile phone that burns their hand. So manufac- 
turers seized on the only solutions they had, says Gargini. First, they 
stopped trying to increase ‘clock rates’ — how fast microprocessors 
execute instructions. This effectively put a speed limit on the chip’s 
electrons and limited their ability to generate heat. The maximum clock 
rate hasn't budged since 2004. 

Second, to keep the chips moving along the Moore's law performance 
curve despite the speed limit, they redesigned the internal circuitry so 
that each chip contained not one processor, or ‘core, but two, four or 
more. (Four and eight are common in today’s desktop computers and 
smartphones.) In principle, says Gargini, “you can have the same output 
with four cores going at 250 megahertz as one going at 1 gigahertz”. In 
practice, exploiting eight processors means that a problem has to be 
broken down into eight pieces — which for many algorithms is dif- 
ficult to impossible. “The piece that can’t be parallelized will limit your 
improvement,’ says Gargini. 

Even so, when combined with creative redesigns to compensate for 
electron leakage and other effects, these two solutions have enabled 
chip manufacturers to continue shrinking their circuits and keeping 
their transistor counts on track with Moore’s law. The question now is 
what will happen in the early 2020s, when continued scaling is no longer 
possible with silicon because quantum effects have come into play. What 
comes next? “We're still struggling,” says An Chen, an electrical engineer 
who works for the international chipmaker GlobalFoundries in Santa 
Clara, California, and who chairs a committee of the new road map that 
is looking into the question. 

That is not for a lack of ideas. One possibility is to embrace a 
completely new paradigm — something like quantum computing, 
which promises exponential speed-up for certain calculations, or 
neuromorphic computing, which aims to model processing elements 
on neurons in the brain. But none of these alternative paradigms has 
made it very far out of the laboratory. And many researchers think that 
quantum computing will offer advantages only for niche applications, 
rather than for the everyday tasks at which digital computing excels. 
“What does it mean to quantum-balance a chequebook?” wonders 
John Shalf, head of computer-science research at the Lawrence Berkeley 
National Laboratory in Berkeley, California. 


MATERIAL DIFFERENCES 
A different approach, which does stay in the digital realm, is the quest 
to find a ‘millivolt switch’: a material that could be used for devices at 
least as fast as their silicon counterparts, but that would generate much 
less heat. There are many candidates, ranging from 2D graphene-like 
compounds to spintronic materials that would compute by flipping 
electron spins rather than by moving electrons. “There is an enormous 
research space to be explored once you step outside the confines of the 
established technology,’ says Thomas Theis, a physicist who directs the 
nanoelectronics initiative at the Semiconductor Research Corporation 
(SRC), a research-funding consortium in Durham, North Carolina. 
Unfortunately, no millivolt switch has made it out of the laboratory 
either. That leaves the architectural approach: stick with silicon, but 
configure it in entirely new ways. One popular option is to go 3D. 
Instead of etching flat circuits onto the surface of a silicon wafer, build 
skyscrapers: stack many thin layers of silicon with microcircuitry 
etched into each. In principle, this should make it possible to pack 
more computational power into the same space. In practice, however, 
this currently works only with memory chips, which do not havea 
heat problem: they use circuits that consume power only when a 
memory cell is accessed, which is not that often. One example is the 
Hybrid Memory Cube design, a stack of as many as eight memory 
layers that is being pursued by an industry consortium originally 
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launched by Samsung and memory-maker Micron Technology in 
Boise, Idaho. 

Microprocessors are more challenging: stacking layer after layer of hot 
things simply makes them hotter. But one way to get around that prob- 
lem is to do away with separate memory and microprocessing chips, 
as well as the prodigious amount of heat — at least 50% of the total — 
that is now generated in shuttling data back and forth between the two. 
Instead, integrate them in the same nanoscale high-rise. 

This is tricky, not least because current-generation microprocessors 
and memory chips are so different that they cannot be made on the same 
fab line; stacking them requires a complete redesign of the chip’s structure. 
But several research groups are hoping to pull it off. Electrical engineer 
Subhasish Mitra and his colleagues at Stanford University in California 
have developed a hybrid architecture that stacks memory units together 
with transistors made from carbon nanotubes, which also carry current 
from layer to layer’. The group thinks that its architecture could reduce 
energy use to less than one-thousandth that of standard chips. 


GOING MOBILE 
The second stumbling block for Moore's law was more ofa surprise, but 
unfolded at roughly the same time as the first: computing went mobile. 

Twenty-five years ago, computing was defined by the needs of 
desktop and laptop machines; supercomputers and data centres 
used essentially the same microprocessors, just packed together in 
much greater numbers. Not any more. Today, computing is increas- 
ingly defined by what high-end smartphones and tablets do — not 
to mention by smart watches and other wearables, as well as by the 
exploding number of smart devices in everything from bridges to the 
human body. And these mobile devices have priorities very different 
from those of their more sedentary cousins. 

Keeping abreast of Moore’s law is fairly far down on the list — if 
only because mobile applications and data have largely migrated to the 
worldwide network of server farms known as the cloud. Those server 
farms now dominate the market for powerful, cutting-edge micropro- 
cessors that do follow Moore’s law. “What Google and Amazon decide 
to buy has a huge influence on what Intel decides to do,’ says Reed. 

Much more crucial for mobiles is the ability to survive for long periods 
on battery power while interacting with their surroundings and users. 
The chips ina typical smartphone must send and receive signals for voice 
calls, Wi-Fi, Bluetooth and the Global Positioning System, while also sens- 
ing touch, proximity, acceleration, magnetic fields — even fingerprints. 
On top of that, the device must host special-purpose circuits for power 
management, to keep all those functions from draining the battery. 

The problem for chipmakers is that this specialization is undermining 
the self-reinforcing economic cycle that once kept Moore's law hum- 
ming. “The old market was that you would make a few different things, 
but sell a whole lot of them,” says Reed. “The new market is that you have 
to make a lot of things, but sell a few hundred thousand apiece — so it 
had better be really cheap to design and fab them?” 

Both are ongoing challenges. Getting separately manufactured 
technologies to work together harmoniously in a single device is often a 
nightmare, says Bottoms, who heads the new road map’s committee on the 
subject. “Different components, different materials, electronics, photonics 
and so on, all in the same package — these are issues that will have to be 
solved by new architectures, new simulations, new switches and more.” 

For many of the special-purpose circuits, design is still something of 
a cottage industry — which means slow and costly. At the University of 
California, Berkeley, electrical engineer Alberto Sangiovanni-Vincentelli 
and his colleagues are trying to change that: instead of starting from 
scratch each time, they think that people should create new devices by 
combining large chunks of existing circuitry that have known function- 
ality’. “It’s like using Lego blocks,” says Sangiovanni-Vincentelli. It’s a 
challenge to make sure that the blocks work together, but “if you were to 
use older methods of design, costs would be prohibitive”. 

Costs, not surprisingly, are very much on the chipmakers’ minds these 
days. “The end of Moore's law is not a technical issue, it is an economic 


FEATURE | NEWS 


issue,’ says Bottoms. Some companies, notably Intel, are still trying to 
shrink components before they hit the wall imposed by quantum effects, 
he says. But “the more we shrink, the more it costs”. 

Every time the scale is halved, manufacturers need a whole new 
generation of ever more precise photolithography machines. Building a 
new fab line today requires an investment typically measured in many 
billions of dollars — something only a handful of companies can afford. 
And the fragmentation of the market triggered by mobile devices is mak- 
ing it harder to recoup that money. “As soon as the cost per transistor at 
the next node exceeds the existing cost,” says Bottoms, “the scaling stops.” 

Many observers think that the industry is perilously close to that 
point already. “My bet is that we run out of money before we run out 
of physics,” says Reed. 

Certainly it is true that rising costs over the past decade have forced a 
massive consolidation in the chip-making industry. Most of the world’s 


production lines now belong to a compara- 
tive handful of multinationals such as Intel, 
“MY BET IS Samsung and the Taiwan Semiconductor 
Manufacturing Company in Hsinchu. These 
TH AT WE manufacturing giants have tight relationships 


with the companies that supply them with 
materials and fabrication equipment; they are 
already coordinating, and no longer find the 
road-map process all that useful. “The chip 
manufacturer’s buy-in is definitely less than 
before,’ says Chen. 
Take the SRC, which functions as the US 
industry’s research agency: it was a long- 
time supporter of the road map, says SRC 
vice-president Steven Hillenius. “But about 
WE RU N three years ago, the SRC contributions went 
away because the member companies didn't 
OUT 0 F see the value in it” The SRC, along with the 
SIA, wants to push a more long-term, basic 
by 

PHYS | CS. for it — possibly through the White House's 
National Strategic Computing Initiative, 

launched in July last year. 

That agenda, laid out in a report” last September, sketches out the 
research challenges ahead. Energy efficiency is an urgent priority — 
especially for the embedded smart sensors that comprise the ‘Internet 
of things; which will need new technology to survive without batteries, 
using energy scavenged from ambient heat and vibration. Connectivity 
is equally key: billions of free-roaming devices trying to communicate 
with one another and the cloud will need huge amounts of bandwidth, 
which they can get if researchers can tap the once-unreachable terahertz 
band lying deep in the infrared spectrum. And security is crucial — the 
report calls for research into new ways to build in safeguards against 
cyberattack and data theft. 
in coming years. At least some industry insiders, including Shekhar 
Borkar, head of Intel’s advanced microprocessor research, are optimists. 
Yes, he says, Moore's law is coming to an end in a literal sense, because 
the exponential growth in transistor count cannot continue. But from 
the consumer perspective, “Moore's law simply states that user value 
doubles every two years”. And in that form, the law will continue as 
long as the industry can keep stuffing its devices with new functionality. 
The ideas are out there, says Borkar. “Our job is to engineer them? = 


research agenda and secure federal funding 
These priorities and others will give researchers plenty to work on 


M. Mitchell Waldrop is a features editor for Nature. 
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Scientists are becoming 
increasingly frustrated by 
how long it seems to take 
to publish papers. But is it 
really getting worse? 


THE WAITING GAME 


BY KENDALL POWELL 
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hen Danielle Fraser first 
submitted her paper for 
publication, she had little 
idea of the painful saga that 
lay ahead. 
She had spent some 
18 months studying thousands 
of fossil species spread across 
North America from the past 36 million years, 
and now she had an intriguing result: animal 
populations were spread widest across lati- 
tudes in warm, wet climates. Her work, crucial 
to earning her PhD at Carleton University in 
Ottawa, Canada, might be used to make predic- 
tions about the response of mammals to climate 
change — a key question in ecology today. So, 
with her PhD adviser’s encouragement, she sent 
it to Science in October 2012. 

Ten days later, the paper was rejected with 
a form letter. She sent it to another prestig- 
ious journal, the Proceedings of the National 
Academy of Sciences. Rejected. Next, she tried 
Ecology Letters. Bounced. “At this point, I 
definitely was frustrated. I hadn’t even been 
reviewed and I would’ve loved to know how to 
improve the paper,’ recalls Fraser. “I thought, 
‘Let's just get it out and go to a journal that will 
assess the paper.” 

In May 2013, she submitted the paper to 
Proceedings of the Royal Society B, considered 
a high-impact journal in her field. The journal 
sent it out for review — seven months after her 
initial submission to Science. “Finally!” Fraser 
thought. What she didn’t know was that she had 
taken only the first steps down the long, bumpy 
road to publication: it would take another 
three submissions, two rejections, two rounds 
of major revisions and numerous drafts before 
the paper would finally appear. By that point, 
she could hardly bear to look at it. 

Fraser's frustration is widely shared: research- 
ers are increasingly questioning the time it takes 
to publish their work. Many say that they feel 
trapped in a cycle of submission, rejection, 
review, re-review and re-re-review that seems 
to eat up months of their lives, interfere with job, 
grant and tenure applications and slow down 
the dissemination of results. In 2012, Leslie 
Vosshall, a neuroscientist at the Rockefeller 
University in New York City, wrote a commen- 
tary that lamented the “glacial pace” of scientific 
publishing’. “In the past three years, ifanything, 
it's gotten substantially worse,” she says now. “It 
takes forever to get the work out, regardless of 
the journal. It just takes far too long” 

But is the publication process actually 
becoming longer — and, if so, then why? To find 
out, Nature examined some recent analyses on 
time to publication — many of them performed 
by researchers waiting for their own work to see 
the light of day — and spoke to scientists and 
editors about their experiences. 

The results contain some surprises. Daniel 
Himmelstein, a computational-biology 
graduate student at the University of Cali- 
fornia, San Francisco, analysed all the papers 


indexed in the PubMed database that had listed 
submission and acceptance dates (see go.nature. 
com/zjdqhn). His study, done for Nature, 
found no evidence for lengthening delays’: 
the median review time — the time between 
submission and acceptance of a paper — has 
hovered at around 100 days for more than 30 
years (see ‘Paper wait’). But the analysis comes 
with major caveats. Not all journals — includ- 
ing some high-profile ones — deposit such 
time-stamp data in PubMed, and some jour- 
nals show when a paper was resubmitted, rather 


“It takes forever 
to get the work 
out, regardless 
of the journal.” 


than submitted for the first time. “Resetting the 
clock is an especially pernicious issue,” Him- 
melstein says, and it means that the analysis 
might be underestimating publication delays. 

Some data suggest that wait times have 
increased within certain subsets of journals, 
such as popular open-access ones and some 
of the most sought-after titles. At Nature, the 
median review time has grown from 85 days 
to just above 150 days over the past decade, 
according to Himmelstein’s analysis, and at 
PLoS ONE it has risen from 37 to 125 days over 
roughly the same period. 

Many scientists find this odd, because they 
expect advances in digital publishing and the 
proliferation of journals to have sped things up. 
They say that journals are taking too long to 
review papers and that reviewers are request- 
ing more data, revisions and new experiments 
than they used to. “We are demanding more and 
more unreasonable things from each other,’ says 
Vosshall. Journal editors counter that science 
itselfhas become more data-rich, that they work 
to uphold high editorial and peer-review stand- 
ards and that some are dealing with increasing 
numbers of papers. They also say that they are 
taking steps to expedite the process. 

Publication practices and waiting times 
also vary widely by discipline — with social 
sciences being notoriously slow. In physics, the 
pressure to publish fast is reduced because of 
the common practice of publishing preprints 
— early versions of a paper before peer review 
— on the arXiv server. Some of the loudest 
complaints about publication delays come 
from those in biological fields, in which com- 
petition is fierce and publishing in prestigious 
journals can be required for career advance- 
ment. This month, a group of more than 70 sci- 
entists, funders, journal editors and publishers 
are meeting at the Howard Hughes Medical 
Institute campus in Chevy Chase, Maryland, 
to discuss whether biologists should adopt the 
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preprint model to accelerate publishing. “We 
need a fundamental rethinking of how we do 
this,” Vosshall says. 


THE PITCH 

In March 2012, Stephen Royle, a cell biologist 
at the University of Warwick, UK, started on 
a publication mission of his own. His latest 
work answered a controversial question about 
how cells sense that chromosomes are lined 
up before dividing, so he first sent it to Nature 
Cell Biology (NCB), because it is a top journal 
in his field and an editor there had suggested 
he send it after hearing Royle give a talk. It 
was rejected without review. Next, he sent it to 
Developmental Cell. Rejected. His next stop, the 
Journal of Cell Biology, sent the paper out for 
review. It came back with a long list of necessary 
revisions — anda rejection. 

Royle and his lab spent almost six months 
doing the suggested experiments and revis- 
ing the paper. Then he submitted the updated 
manuscript to Current Biology. Rejected. EMBO 
Journal. Reviewed and rejected. 

Finally, in December 2012, he submitted it to 
the Journal of Cell Science (JCS), where it was 
reviewed. One reviewer mentioned that they 
had already assessed it at another journal and 
thought that it should have been published 
then. They wrote that the work was “beauti- 
fully conducted, well controlled, and conserva- 
tively interpreted” A second reviewer said that 
it should not be published. The editor at JCS 
decided to accept it. The time between first sub- 
mission to Nature Cell Biology and acceptance 
at JCS was 317 days. It appeared online another 
53 days later’. The work went on to win the JCS 
prize as the journal's top paper for 2013. 

Despite the accolade, Royle says that the 
multiple rejections were demoralizing for his 
student, who had done the experiments and 
needed the paper to graduate. He also thinks 
that the paper deserved the greater exposure 
that comes from publication in a more prestig- 
ious journal. “Unfortunately, the climate at the 
moment is that if papers aren't in those very top 
journals, they get overlooked easily,’ he says. 
And Royle, who has done several publication- 
time analyses and blogged about what he found, 
has shown that this experience is not unusual. 
When he looked at the 28 papers that his lab had 
published in the previous 12 years, the average 
time to gestate from first submission to publi- 
cation was the same as a human baby — about 
9 months (see go.nature.com/79h2n3). 

But how much of these delays were his own 
doing? To publish the chromosome paper, 
Royle indulged in the all-too-familiar practice 
of journal shopping: submitting first to the most 
prestigious journals in his field (often those with 
the highest impact factor) and then working his 
way down the hierarchy. (Nature Cell Biology’s 
current impact factor is 19; JCS’s is 5.) Journal 
impact factor or reputation are widely used by 
scientists and grant-review and hiring commit- 
tees as a proxy for the quality of the paper. On 
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REVIEW TIME 


An analysis of all papers in PubMed up to 2015 with listed submission and 
acceptance dates suggests that the median time from submission to acceptance 
has hovered at around 100 days, although it has gone up at some journals. 


A few journals with quick 
review times started in 
2000, lowering the median. 
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An analysis of PubMed papers published in 2013 suggests that journals with the 
lowest and highest impact factors have the longest review times. 
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the flip side, critics say that editors seek out the 
splashiest papers to boost their publication's 
impact factor, something that encourages jour- 
nal shopping, increases rejection rates and adds 
to the wait time. Journal editors reject this; Ritu 
Dhand, Nature editorial director in London, 
says that Nature’s policy of selecting original, 
important work “may lead to citation impact 
and media coverage, but Nature editors aren't 
driven by those considerations”. 

How much time does journal shopping add? 
In the analysis of his group's research papers, 
Royle found that more than half were shopped 
around, and that this consumed anywhere from 
a few days to more than eight months. He went 
on to analyse all the papers published in 2013 
that are indexed in PubMed, and examined 
whether higher impact factor correlated with 
longer median publication times. He found an 
inverted bell-shaped curve — the journals with 
the lowest and highest impact factors had longer 
review times than did those in the middle. For 
the vast majority of those in the middle, review 
times stood at around 100 days — matching 
Himmelstein’s analysis. Those with the very 
highest impact factors (30-50) had a review 
time of 150 days, supporting the idea that 
pitching a paper to a series of top journals could 
result in significant delays in publication. 

Many scientists, editors and publishers 
have long acknowledged that journal name is 
a flawed measure of the quality and value of 
a piece of research — but the problem shows 
no signs of going away. “Where your paper is 
published doesn’t say anything about you, your 
paper’s impact or whether it’s right or wrong,” 
says Maria Leptin, director of EMBO, an 
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Some scientists complain that publishing papers takes too long, 
but data show a complex picture. 
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The same analysis of Pubmed papers suggests that the time between acceptance 
and publication has dropped, probably because technology has improved. 
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An analysis of biology papers published in 3 journals showed that the number of 
panels in experimental figures jumped between 1984 and 2014, a hint that the 


amount of data per paper is increasing. 
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organization of Europe's leading life scientists 
and publisher of the EMBO Journal. “Nobody 
has the courage to say, we, as a funding organi- 
zation, or we, as a tenure committee, are not 
going to look at where you publish as opposed 
to what you publish” 

And the obsession with prestigious journals 
is just one source of delay — as Fraser, who was 
battling to publish her paper on ancient animal 
populations — was about to found out. 


PEER REVIEW 

By October 2013, a full year had passed since 
Fraser had first submitted her paper to a jour- 
nal, and she had pretty much stopped caring 
about impact factor. By this point, the paper 
had spent two months in review at Proceedings 
of the Royal Society B, before coming back with 
mixed reports — and a rejection. So Fraser 
decided to try PLoS ONE, a journal that says it 
will publish any rigorous science, regardless of 
its significance, scope or anticipated citations. 
It has an impact factor of 3, anda reputation for 
rapid publication. 

PLoS ONE sent the paper out to a single 
reviewer. Two months later, Fraser got a deci- 
sion letter that essentially stated that the paper 
was rejected but might be eligible for re-review 
if the suggested revisions were made. She made 
the revisions, adding citations and a small 
amount of reanalysis. In March 2014, she resub- 
mitted the manuscript, which PLoS ONE sent 
out toa different reviewer. Another two months 
passed before she received the new review: 
major revisions, please. 

“Tm just happy they didn't tell me to go away, 
recalls Fraser. “I do have e-mails from the time 
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that say, ‘1-millionth draft!” She persevered, 
making more revisions to meet the reviewer's 
demands, and in June 2014 submitted the 
paper to PLoS ONE for a third time. Success! 
The paper* was published online 23 months 
after she had first sent it to Science. The long 
peer-review and revision process did improve 
the paper, Fraser says now. “It was really much 
better” But did the main conclusion of the paper 
change? “Not really” 

Last year, Chris Hartgerink, a behavioural- 
sciences graduate student at Tilburg University 
in the Netherlands, ran an analysis of the Public 
Library of Science (PLOS) family of journals 
since the first one launched in 2003. (He chose 
the journals largely because they make the data 
easily accessible, and because he was waiting for 
a paper to be published in PLoS ONE.) He found 
that the mean review time had roughly doubled 
in the past decade, from 50-130 days to 150-250 
days, depending on the journal (see go.nature. 
com/s3voeq). And when Royle looked at eight 
journals that had published cell-biology papers 
over the past decade, he found that publication 
times had lengthened at seven of them, mostly 
because review times had stretched out. 

One contention is that peer reviewers now 
ask for more. When Ron Vale, a cell biologist 
at the University of California, San Francisco, 
analysed biology papers that had been published 
in Cell, Nature and the Journal of Cell Biology, 
in the first six months of 1984 and compared 
that with the same period in 2014, he found 
that both the average number of authors and 
the number of panels in experimental figures 
rose by 2-4 fold’. This showed, he argued, that 
the amount of data required for a publication 
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had gone up, and Vale suspects that much of the 
added data come from authors trying to meet 
reviewers demands. Scientists grumble about 
overzealous critics who always seem to want 
more, or different, experiments to nail a point. 
“It’s very rare for the revisions to fundamentally 
changea paper — the headline doesn't change,’ 
Royle says. His analysis of his group’s publica- 
tion times showed that almost 4 months of the 
average 9-month gestation was spent revising 
papers for resubmission. 

Many scientists also blame journal editors, 
who, they say, can be reluctant to provide clear 
guidance and decisions to authors when reviews 
are mixed — unnecessarily stringing out the 
review and revision process. Journal heads disa- 
gree, and say that their editors are accomplished 
at handling mixed reviews. Cell editor-in-chief 
Emilie Marcus in Cambridge, Massachusetts, 
says that editors at her journal take responsibil- 
ity for publication decisions and help authors to 
map out a plan for revisions. 

Technological advances mean that research 
now involves handling more and more data, 
editors say, and there is greater emphasis on 
making that information available to the com- 
munity. Marcus says that her journal is working 
to cut review times by, for example, increasing 
the number of papers that go through only one 
round of revision — 14% of their papers did so 
in 2015. In 2009, Cell also restricted the amount 
of supplemental material that could accompany 
papers as a way to keep requests for “additional, 
unrelated experiments” at bay. 

PLOS executive editor Veronique Kiermer, 
based in San Francisco, declined to discuss the 
specifics of Fraser’s paper, but she called its total 
review time of nine months an “outlier” and said 
that it was “not ideal to have research being eval- 
uated bya single person”. She acknowledges that 
PLoS ONE’s publication time has risen; one fac- 
tor is that the volume of papers has, too — from 
200 in 2006 to 30,000 per year now — and it 
takes time to find and assign appropriate editors 
and reviewers. (PLOS used 76,000 reviewers 
in 2015.) Another, says Kiermer, is that the 
number of essential checkpoints — including 
competing-interest disclosures, animal-wel- 
fare reports and screens for plagiarism — have 
increased in the past decade. “We'll do every- 
thing we can both in terms of technology and 
looking at workflows to bring these times to 
publication down,’ she says. 

Dhand says that at Nature, too, editors find it 
harder to find reviewers than in the past, “pre- 
sumably because there are so many more papers 
that need reviewing”. Himmelstein found that 
the number of papers in PubMed more than 
doubled between 2000 and 2015, reaching 
nearly 1 million articles. 


TECHNOLOGY ADVANCES 

Digital publishing may have had benefits in 
shortening ‘production’ time — the time from 
acceptance to publication — rather than time 
in review. In Himmelstein’s analysis, time spent 


in production has halved since the early 2000s, 
falling to a stable median of 25 days. 

Several new journals and online publishing 
platforms have promised to speed up the pro- 
cess even more. Peer], a family of journals that 
launched in 2013, is one of several that now 
encourage open peer review, in which review- 
ers’ names and comments are posted alongside 
articles. The hope is that the transparency will 
prevent unnecessary delays or burdensome 
revision requests from reviewers. 

The biomedical and life-sciences journal 
eLife launched in 2012 with a pledge to make 


“It’s very rare for 
the revisions to 
fundamentally 
change a paper.” 


initial editorial decisions within a few days and 
to review papers quickly. Reviewers get strict 
instructions not to suggest the ‘perfect experi- 
ment, and they can ask for extra analysis only 
if it can be completed within 2 months. Other- 
wise, the paper is rejected. Randy Schekman, 
a cell biologist at the University of California, 
Berkeley, and editor-in-chief of eLife, says that 
these policies mean that more than two-thirds 
of the journal's accepted papers undergo just 
one round of review. 

In a 2015 analysis, Himmelstein created 
a ranking by the median review time for all 
3,482 journals that had papers with time stamps 
in the PubMed database from January 2014 to 
June 2015 (see go.nature.com/sscrr6). Peer] 
had a relatively fast time: 74 days after submis- 
sion. At eLife, it took 108 days, and PLoS ONE 
took 117. By comparison, Cell’s review time was 
127 days; Nature’s was 173 days; PLoS Medicine 
took 177 days; and Developmental Cell was 
among the slowest of the popular biomedical 
journals, at 194 days. Marcus notes that com- 
parison between journals is difficult because 
the publications define received, revised and 
accepted days differently, and that Developmen- 
tal Cell places a high priority on timely review. 


PREPRINTS RECONSIDERED 

One way for biologists to accelerate publication 
is by embracing preprints. These allow work to 
quickly receive credit and critique, says Bruno 
Eckhardt, associate editor of Physical Review E 
and a theoretical physicist at the University of 
Marburg in Germany. “It is almost like going 
on Facebook — it means you are ready to go 
public,” he says. A preprint submitted to bioRxiv 
—a server run by Cold Spring Harbor Labora- 
tory in New York — is published online within 
24 hours and given a digital object identifier 
(DOJ); subsequent revisions are time-stamped 
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and anyone can read and comment on the 
paper. “The minute a research story gets into 
the public domain, it benefits from the collective 
power of different brains looking at a problem,” 
says Vale. What's more, proponents say, preprint 
publishing can simply be added onto the con- 
ventional publication process. F1000Research, 
which launched in 2012, does this by publish- 
ing papers first, then inviting open peer review 
and revision. 

Some scientists are going a step further, and 
using platforms such as GitHub, Zenodo and 
figshare to publish each hypothesis, data collec- 
tion or figure as they go along. Each file can be 
given a DOL so that it is citable and trackable. 
Himmelstein, who already publishes his papers 
as preprints, has been using the Thinklab plat- 
form to progressively write up and publish the 
results of a new project since January 2015. “I 
push ‘publish’ and it gets a DOI with no delay,’ 
he says. “Am I really gaining that much by pub- 
lishing [in a conventional journal]? Or is it bet- 
ter to do what is fastest and most efficient to get 
your research out there?” 

But preprints and real-time digital publishing 
platforms are no panacea. Vosshall says that 
many biologists are “terrified” of preprints 
because they fear getting scooped by competi- 
tors or losing credit and intellectual-property 
rights for their ideas. And even after preprint 
publishing, scientists can still find themselves 
slogging through peer review and chasing high- 
impact journals for a final publication to adorn 
their CV. Vosshall says that the scientific com- 
munity relies on conventional journals to serve 
as a ‘prestige filter’ so that important papers 
are brought to the attention of the right read- 
ers. Without them, “How do we find the good 
stuff?” she asks. 

For Fraser, her PLoS ONE publication proved 
a success. When the paper was finally published 
after its almost-two-year wait, she got positive 
responses, she says. It has been viewed nearly 
2,000 times, had 51 shares on Facebook and 
Twitter and got 280 downloads. The publication 
also helped her to secure her current position 
— asa postdoctoral fellow at the Smithsonian 
Institution Museum of Natural History in 
Washington DC. “I pretty much got the top 
postdoc that I could have gotten” 

Still, the whole process is not something she 
wants to endure again — so these days, she tends 
to send her papers to mid-range journals that 
are likely to publish her work right away. “Ifmy 
ultimate goal is to get a faculty job, I can't afford 
to wait two years on a single paper,’ she says. m 


Kendall Powell is a writer based in Lafayette, 
Colorado. 
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COMMENT 


ROGELIO V. SOLIS/AP 
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Carbon-capture plants provide one way to reduce the amount of atmospheric carbon dioxide. 


scrutinize CO, 
removal methods 


The viability and environmental risks of removing 
carbon dioxide from the air must be assessed if we are 
to achieve the Paris goals, writes Phil Williamson. 


to the United Nations Framework Con- 
vention on Climate Change (UNFCCC) 
agreed to balance the human-driven 
greenhouse-gas budget some time between 
2050 and 2100. This commitment is intended 
to limit the increase in global average tem- 
perature above pre-industrial levels to “well 

below 2°C” — and preferably to 1.5°C. 
A balanced greenhouse-gas budget either 


I: Paris last December, the 196 parties 


requires that industry and agriculture 
produce zero emissions or necessitates the 
active removal of greenhouse gases from 
the atmosphere (in addition to deep and 
rapid emissions cuts). In most modelled 
scenarios that limit warming to 2°C, sev- 
eral gigatonnes of carbon dioxide have to be 
extracted and safely stored each year’. For 
more ambitious targets, tens of gigatonnes 
per year must be removed’. 


at the scale needed to deliver the goal of the 
Paris agreement depends on three things: 
feasibility, cost and acceptability. A crucial 
component of all of these approaches is the 
non-climatic impacts that large-scale CO,- 
removal could have on ecosystems and 
biodiversity. 

Until now, the UNFCCC’s scientific advi- 
sory body, the Intergovernmental Panel on 
Climate Change (IPCC), has paid relatively 
little attention to such impacts. It has fallen 
to other groups to review insights and gaps in 
our understanding of the influence of CO,- 
removal techniques on ecology’ *; to make 
broad assessments of climate-engineering 
schemes’; and to carry out comparative 
modelling studies’. 

It is time for the IPCC, governments and 
other research-funding agencies to invest in 
new, internationally coordinated studies to 
investigate the viability and relative safety of 
large-scale CO, removal. 


FRONT-RUNNERS 

Since its establishment in 1988, the IPCC has 
predominantly involved physical scientists 
and modellers, rather than ecologists. This, 
combined with the only relatively recent 
evidence that emissions reduction alone is 
unlikely to avert dangerous climate change, 
could account for why the IPCC’s roughly 
5,000-page Fifth Assessment Report, 
released in 2013 and 2014, leaves out one 
crucial consideration: the environmental 
impacts of large-scale CO, removal. 

This omission is striking because the set of 
IPCC emissions scenarios that are likely to 
limit the increase in global surface tempera- 
ture to 2°C by 2100 (the aim of the RCP2.6 
‘representative concentration pathway, the 
IPCC climate-change-response scenario that 
achieves the lowest emissions) mostly relies 
on large-scale CO, removal. 

These scenarios assume that two tech- 
niques could be developed to balance the 
carbon budget later this century: bioenergy 
with carbon capture and storage (BECCS), 
and afforestation. BECCS involves grow- 
ing bioenergy crops, from grasses to trees; 
burning them in power stations; stripping 
the CO, from the resulting waste gases; 
and compressing it into a liquid for under- 
ground storage. Afforestation — planting > 
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TAKE YOUR PICK 


A plethora of schemes have been proposed to 
extract carbon dioxide from the atmosphere. 
Here are nine, some more speculative than 
others. 


TECHNIQUE HOW IT WORKS 


Bioenergy with 
carbon capture 
and storage 
(BECCS) 


Afforestation and 
reforestation 


Crops grown for the 
purpose are burnt in power 
stations (providing energy), 
and the resulting CO, is 
captured for secure 
ong-term storage. 


Large-scale tree 
plantations increase 
natural storage of 
carbon in biomass and 
forest soil. 


‘Blue carbon’ 
habitat restoration 


The recovery of degraded 
or over-exploited coastal 
ecosystems that have a 
high potential for carbon 
storage, such as 
saltmarshes and 
mangroves. 


Biochar Carbon from partly burnt 
biomass is added to soil, 
with potential for 


agricultural benefits. 


Enhanced Marine photosynthesis 
ocean and CO, drawdown from 
productivity the atmosphere is 
increased, either by 
adding nutrients to 
\ promote phytoplankton 
«* 


growth in the open ocean 


or through seaweed 
cultivation in shallow seas. 


oP 
Enhanced Crushed olivine or other 
weathering silicate rocks are added 


(using silicate to soil surfaces or the 

rock) ocean for chemical 
absorption of CO,. 
(Could help to reduce 
ocean acidification.) 


Direct air 
capture (DAC) 


Chemicals (or possibly low 
temperatures) are used to 
extract CO, from ambient 
air. Safe CO, transport and 
storage are subsequently 
required. 


Cloud treatment Alkaline rain resulting from 


to increase cloud treatments reacts with, 

alkalinity and removes, atmospheric 
CO,. 

Building with A massive increase in the 

biomass use of biomass (straw and 


timber) as a building 
material removes carbon 
for decades or centuries. 


> trees — also relies on photosynthesis to 
initially remove CO, from the atmosphere. 
Storage is achieved naturally, in timber and 
soil. 

Limiting the global temperature rise to 
2°C, with any confidence, would require 
the removal of some 600 gigatonnes of CO, 
over this century (the median estimate of 
what is needed)*. Using BECCS, this would 
probably require crops to be planted solely 
for the purpose of CO, removal’ on between 
430 million and 580 million hectares of land 
— around one-third of the current total 
arable land on the planet, or about half the 
land area of the United States. 

Unless there are remarkable increases in 
agricultural productivity, greatly exceeding 
the needs of a growing global population, 
the land requirements to make BECCS work 
would vastly accelerate the loss of primary 
forest and natural grassland. Thus, such 
dependence on BECCS could cause a loss of 
terrestrial species at the end of the century 
perhaps worse than the losses resulting from 
a temperature increase of about 2.8 °C above 
pre-industrial levels’®. 

A more fundamental concern is whether 
BECCS would be as effective as it is widely 
assumed to be at stripping CO, from the 
atmosphere. Planting at such scale could 
involve more release than uptake of green- 
house gases, at least initially, as a result of land 
clearance, soil disturbance and increased 
use of fertilizer. When such effects are taken 
into account, the maximum amount of CO, 
that can be removed by BECCS (under the 
RCP2.6 scenario) is estimated to be 391 giga- 
tonnes by 2100. This is about 34% less than 
the median amount assumed to be needed to 
keep the temperature rise below 2°C. Ifless 
optimistic but not unrealistic assumptions 
are made about where the land for bioen- 
ergy crops would come from, a net release of 
135 gigatonnes of CO, could occur by 2100 
(see ‘Future unknown)*. 

Incomplete understanding throws other 
assumptions of the BECCS-based scenarios 
into question’. For instance, little is known 
about the effect of future climatic conditions 
on the yields of bioenergy crops; what the 
water requirements of such crops may be in 
a warmer world; the implications for food 
security if bioenergy production directly 
competes with food production; and the 
feasibility (including commercial viability) 
of the associated carbon capture and storage 
infrastructure. 

Less is expected of afforestation in 
terms of its ability to take CO, out of the 
atmosphere. Yet there is a near-universal 
assumption that increased forest cover is 
environmentally desirable. This is true in 
most cases of reforestation, particularly if 
a mixture of native trees is planted or re- 
planted, rather than an exotic monoculture. 
But afforestation can also involve the loss of 
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natural ecosystems. And planting swathes of 
forest will cause complex changes in cloud 
cover, albedo (reflectance) and the soil- 
water balance (through changes to evapo- 
ration and plant transpiration), all of which 
affect Earth’s surface temperature. 

Counter-intuitively, afforestation at 
mid-latitudes and in northern, boreal for- 
ests may have a net warming effect, despite 
increasing the storage of carbon’. Also, as 
with bioenergy crops, it is difficult, if not 
impossible, to reliably quantify the effects 
of future climate change during 2050-2100. 
Increased fires, droughts, pests and disease 
could jeopardize the stability of carbon stor- 
age in newly planted forests. 


OTHER OPTIONS 

There is no shortage of other ideas for CO, 
removal by biological, geochemical and 
chemical means (see “Take your pick’). For 
all such schemes, modelling the theoretical 
potential of a proposed approach can give 
a completely different picture from that 
obtained when environmental impacts — 
not to mention practicalities, governance 
and acceptability — are considered. 

The roughly 25 years of discussion, 
research and policymaking on ocean ferti- 
lization, another CO,-removal technique, is 
a case in point. Since the link was first made 
between natural changes in the input of dust 
to the ocean, ocean productivity and cli- 
matic conditions, there has been a dramatic 
scaling-down of expectations of how effec- 
tive ocean fertilization might be as a way to 
avoid human-driven global warming". 

During the 1990s, researchers postulated 
that for every tonne of iron added to sea- 
water, tens of thousands of tonnes of car- 
bon (and hence CO,) could be fixed by the 

resulting blooms of 


“Action phytoplankton. This 
should focus quantity has been 
onurgent whittled down over 
emissions the years with the 
reductions.” realization that most 


of the CO, absorbed 
by such blooms — stimulated either by add- 
ing iron or other nutrients to seawater, or by 
enhancing upwelling through mechanical 
means — would be released back into the 
atmosphere when the phytoplankton decom- 
posed. Moreover, a large-scale increase in 
plankton productivity in one region (across 
the Southern Ocean, say) could reduce the 
yields of fisheries elsewhere by depleting 
other nutrients, or increase the likelihood of 
mid-water deoxygenation. Such risks have 
resulted in the near-universal rejection of 
ocean fertilization as a climate intervention, 
through bodies such as the Convention on 
Biological Diversity (CBD)’. 
More recently, other, potentially more 
controllable, ocean-based CO,-removal 
techniques have been suggested, such as the 


SOURCE: REF. 8 


FUTURE UNKNOWN 


Projections of how much carbon dioxide could be removed from the atmosphere using bioenergy 
with carbon capture and storage (BECCS) between 2020 and 2100 vary drastically depending on 
where the land for growing bioenergy crops comes from. 


IPCC projection to achieve Paris goal 


CO, removal needed to 
keep temperature rise below p> 
2°C* by 2100 


Bioenergy crops grown on 
abandoned land only > 


Some forests converted to 
cropland; no increase in p> 
agricultural productivity 


Large areas of forest converted 
to cropland; agricultural > 
productivity increases 


{li BECCS scenario to remove CO, 


-700 -600 -500 -400 -300 -200 -100 (0) 100 200 
Amount of atmospheric CO, removed or added by 2100 


*Above pre-industrial levels 


cultivation of seaweed to cover up to 9% of 
the global ocean”. The specific environmen- 
tal implications of this method have yet to be 
assessed. Yet such an approach would clearly 
affect, and potentially displace, existing 
marine ecosystems that have high economic 
value. (Shallow and coastal waters currently 
provide around 90% of global fish catches.) 

Back on land, other techniques include 
those to increase the amount of carbon 
sequestered in the soil, for example by plough- 
ing in organic material such as straw, reducing 
ploughing (to limit soil disturbance) or add- 
ing biochar (a form of charcoal). Another idea 
is to enhance weathering, which involves the 
absorption of CO, from the atmosphere by 
certain silicate rocks. Existing insights from 
agriculture, geoscience and mineral extrac- 
tion enable more informed assessments of 
the feasibility and acceptability’ ° of these 
approaches. Yet it is crucial to know more 
about the permanence of carbon storage for 
biologically based methods, and the envi- 
ronmental impacts that might result if such 
approaches are used at vast scale*®. 

For example, the use of biochar raises 
land-use issues. In addition, millions of 
hectares of soil darkened by the application 
of biochar would decrease albedo, increas- 
ing heat absorption. The addition of pul- 
verized rock to the soil surface, by contrast, 
would increase reflectivity. Yet to reduce 
the amount of CO, in the atmosphere by 
around 50 parts per million (a roughly 12% 
decrease from current levels), 1-5 kilograms 
per square metre of silicate rock would need 
to be applied each year to 2 billion to 6.9 bil- 
lion hectares of land (15-45% of Earth’s 
land surface area), mostly in the tropics”. 
The volume of rock mined and processed 
would exceed the amount of coal currently 
produced worldwide, with the total costs of 
implementation estimated to be between 
US$60 trillion and $600 trillion. And the 


(gigatonnes) 


chemistry and biology of rivers and adja- 
cent ocean areas would be radically altered. 

The most environmentally benign option 
for large-scale CO, removal may be direct air 
capture (DAC). This can be done by passing 
air through anion-exchange resins that con- 
tain hydroxide or carbonate groups, which, 
when dry, absorb CO,, and release it when 
moist. The extracted CO, can then be com- 
pressed, stored in liquid form and deposited 
underground using carbon capture and stor- 
age technologies®. 

The operational costs for DAC cover 
a similar range to those estimated for 
enhanced weathering. The extraction pro- 
cess would also need land and probably 
water, and, as for BECCS, there is a risk of 
CO, leaking out of geological reservoirs. 
Such risks can be minimized by storing the 
liquid CO, beneath the sea or by using geo- 
chemical transformation, which involves 
in situ reactions between CO, and certain 
rock types. In theory, cooling (rather than 
chemistry) to liquefy out the CO, could also 
be used to remove CO, from ambient air™. 
The technical feasibility, costs and potential 
environmental impacts of this approach 
— which could involve setting up plants in 
remote places such as Antarctica — have yet 
to be investigated. 


URGENT ACTION 
As well as a major step up in research, urgent 
attention must be given to clarification at the 
UN level of what is considered geoengineer- 
ing and what is climate mitigation. Once 
considered distinct approaches, the meaning 
of these terms has become fuzzier in recent 
years. CO, removal is frequently included in 
both categories, generating confusion and 
contradiction. 

This is crucial to resolve because mitiga- 
tion and geoengineering have very different 
psychological connotations. Mitigation is 


universally considered to be a good thing 
that reduces risk or damage. Geoengineering 
frequently elicits suspicion, or is dismissed 
asa ‘high-risk, high-tech approach that may 
itself be harmful. 

CO, removal was not specifically dis- 
cussed in Paris. However, the large-scale 
extraction of CO, does seem to be a require- 
ment to meet the goal of the Paris agreement. 
The CBD considers most, if not all, tech- 
niques for CO, removal to be climate geo- 
engineering, which it has repeatedly rejected 
as a policy option for addressing climate 
change. With a few exceptions, the same 195 
or so governments make up both the UNF- 
CCC and the CBD. 

One solution would be to abandon the 
term climate geoengineering and simply 
assess the various methods for mitigating 
climate change on a case-by-case basis. 

The Paris agreement shows where we want 
to go — the brave new world of a balanced 
carbon budget — but not how to get there. For 
now, action should focus on urgent emissions 
reductions and not on an unproven ‘emit now, 
remove later’ strategy. But the unwelcome 
truth is that, unless a lot more effort is made 
to cut emissions, significant CO, removal 
will need to begin around 2020, with up to 
20 gigatonnes of CO, extracted each year by 
2100 to keep the global temperature increase 
“well below 2°C”. 

Is that feasible? What environmental risks 
and constraints are involved? We need to 
know. # 


Phil Williamson is a science coordinator 
for the Natural Environment Research 
Council and an associate fellow in the School 
of Environmental Sciences at the University 
of East Anglia in Norwich, UK. 

e-mail: p.williamson@uea.ac.uk 
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COMMENT 


Shareholders must vote for 
climate-change mitigation 


Investors who are standing idly by as emissions erode the value of their stock could 
find themselves in court, warn Howard Covington and colleagues. 


he largest 500 companies listed on 
the world’s stock markets account 
for about half of market value and 
14% of global emissions’. Steel company 
ArcelorMittal, utilities firm RWE and oil giant 
ExxonMobil are among the top ten. All listed 
companies probably account for roughly one- 
quarter of global emissions. Many of these 
companies are multinationals, and are not 
party to the climate deal made in Paris last 
December, so they could resist or even chal- 
lenge government plans to reduce emissions. 
One way to encourage their cooperation 
would be for their shareholders to propose 
and vote for resolutions aimed at increas- 
ing or preserving stock value while reducing 
emissions. Why would shareholders do this? 
Senior lawyers have concluded that those who 
manage other people's money have a duty to 
control for ‘material risks’. In finance, that 
means risks that might trigger a 5% or more 
loss in investment value. Climate damage in 
the future is expected to be one such risk. 
Therefore, we (an investment manager, an 
environmental lawyer and a climate econo- 
mist) believe that clients and beneficiaries of 
investment firms might have a legal case to 
bring against their investors who stand idly by 
as emissions erode the value of their stock. We 
are researching and designing such actions. 


CASE LAW 
Legal rulings about duties can have far-reach- 
ing consequences. In June 2015, a Dutch court 
found that the government has a duty of care 
to its citizens to minimize the risk of climate 
change, and ordered the Dutch government 
to go beyond its current plan to reduce the 
country’s greenhouse-gas emissions. Pend- 
ing an appeal hearing, the government is now 
working towards a 25% reduction by 2020. 
In another incident last year, a case brought 
by ClientEarth (where one of us, J.T., works ), 
the Supreme Court of the United Kingdom 
found that the British government has a duty 
to obey its own air-quality laws. Because this 
case involved the European Court, the rulings 
of which bind all European Union member 
states, ClientEarth is preparing a slew of par- 
allel legal actions in other EU countries. Ten 
such cases were filed in Germany last Novem- 
ber. One has already reached judgment, 
resulting in a court order to the city of Wies- 
baden to take all necessary action to clean up 


its air. The Chinese judiciary has sought views 
from ClientEarth on how legal action against 
emitters could help to address China’s air and 
other pollution challenges. 

Systemic climate risk for investors can be 
expressed in terms of the reduction in value, 
due to warming this century, ofa diversified 
investment portfolio at a given probability. 
This is called a climate value at risk. If eco- 
nomic output is expected to be damaged 
by future warming*”, then portfolio value 
may be lower than 


itotherwise would “Managers 
be.Economicdam- couldpush 

age could accel- companies to 
erate if extreme adopt the goals 
weather resulting of the Paris 
from high levels agreement 

of warming cre- jf their 


ates regional insta- 
bilities and causes 
large-scale migration by making populated 
areas uninhabitable’. 

Assessment by the investment commu- 
nity of climate value at risk is still at an early 
stage®. Initial estimates point to a value at 
risk of 10% or so at probabilities of loss that 
are within the range typically used in finan- 
cial management. The total value of the 
shares listed on the world’s equity markets 
is around US$70 trillion, suggesting that the 
potential loss from a plausible worst case is 
currently roughly $7 trillion. This is a meas- 
ure of the systemic climate risk to the world’s 
equity markets. 


constitutions.” 


MANAGING RISK 

This risk exceeds the legal test of materiality 
and should be too large to ignore. In practice, 
most investors neglect it entirely. 

What should money managers be doing 
about systemic climate risk? Because it arises 
from damage to the economy as a whole, the 
risk cannot be reduced by hedging invest- 
ments or by limiting exposure to particular 
assets. Investors should actively encourage 
the companies they own to reduce emissions, 
for example by urging profitable investment 
in energy efficiency or emissions-reducing 
changes to processes and by discourag- 
ing risky capital expenditure on fossil-fuel 
exploration and production. 

Better still, managers could push compa- 
nies to adopt the goals of the Paris agreement 
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into their constitutions and to publish 
business plans for how they will deal with the 
transition to a zero-carbon economy. 

A few forward-looking investors are 
already acting in this way and are beginning 
to have an effect. Last year, CCLA, a British 
investment firm, led the filing of shareholder 
resolutions requiring greater emissions 
disclosure from Shell and BP. This was fol- 
lowed by BHP Billiton, a mining company, 
responding to concerned investors by pro- 
ducing an analysis of how its business would 
operate in an emissions-constrained world. 
These are useful steps, but they are small. 

To produce a wholesale change in attitude, 
a court ruling on the obligations of fiduciary 
investors to control systemic climate risk will 
probably be needed. Because of the uncertain- 
ties in estimating future climate damage, this 
will not be an easy case to bring. But we antici- 
pate that such a case will ultimately succeed. 
We are seeking more and better quantitative 
assessments of systemic climate risk and, in 
particular, the risk to investment portfolios, to 
help to clarify the relevant obligations. 

The Paris agreement was a diplomatic 
triumph. But it will remain largely volun- 
tary until countries translate the promises 
into their own legal systems. Then it will take 
constant pressure to ensure implementation 
and prevent backsliding. Investors will play 
a major part, either voluntarily or because 
they will be forced by the courts to meet their 
legal obligations to manage climate risk. m 
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JAN VILCEK'S PERSONAL ARCHIVES 


Jan Vilcek at medical school in Bratislava in the 1950s. 


DRUG DISCOVERY 


A life of tumult 
and triumph 


Marian Turner reviews the memoir of emigre virologist 
and millionaire philanthropist Jan Vilcek. 


edical students in communist 
DM cess in the early 1950s 

were taught that Mendelian genet- 
ics was “bourgeois pseudoscience” The theory 
of heredity that toed the party line came from 
Soviet agronomist Trofim Lysenko: Stalin 
championed his idea that acquired traits 
could be inherited. Jan Vilcek was one of 
several students of the time who saw through 
this nonsense, but kept quiet. The budding 
virologist already had two decades of experi- 
ence of staying under the authoritarian radar 
— a frustration that would later prompt him 


to break free in spectacular fashion. 

In Love and Science, Vilcek traces that 
tumultuous, ultimately triumphant journey. 
He takes us from his birth in 1933, to Jewish 
parents in Bratislava (in what is now Slo- 
vakia), to his current position of president 
and co-founder, with his art-historian wife 
Marica, of the Vilcek 
Foundation in New 
York City. This multi- 
million-dollar phil- 
anthropic enterprise 
uses Vilcek’s royalties 


For more on science 
in culture see: 


BOOKS & ARTS 


from the autoimmune 
drug Remicade (inf- 
liximab) to honour 
the contribution of 
immigrant scientists 
and artists to US society. 

Vilcek begins with how he discovered the 
drug. He cut his research teeth on interferons 
— proteins produced in response to infec- 
tion — at the Bratislava Institute of Virol- 
ogy in the early 1960s. But it was in 1988 at 
New York University’s School of Medicine 
that the main event occurred: the develop- 
ment of a monoclonal antibody against the 
inflammation-inducing molecule tumour 
necrosis factor (TNF). As Vilcek and others 
revealed, people with rheumatoid arthritis 
had high levels of TNE. Remicade was the 
first TNF-blocking agent approved by the US 
Food and Drug Administration, initially for 
treating Crohn’s disease, in 1998. Today, it 
is used against multiple autoimmune con- 
ditions, including rheumatoid arthritis and 
psoriasis, and remains one of the world’s five 
top-selling prescription drugs. 

Vilcek hymns the importance to medicine 
of fundamental science, serendipity, colle- 
giality and some risk-taking. He supports 
funding for translational medicine, but 
cautions that without continued support 
for basic research, there might be nothing 
to translate. However, the book’s early detail 
about science, clinical trials and intellectual- 
property processes behind Remicade could 
be intimidating to those who picked up the 
autobiography for its personal tale. Love and 
Science becomes more engaging in its second 
section, when it steps back to the memories 
of a Jewish boy born in the year that Adolf 
Hitler came to power in Germany. 

It is a tale of ingenuity and near misses. 
Vilcek’s mother officially converted to Cathol- 
icism in 1939, his father in 1942, allowing 
both to be exempted from the Czechoslova- 
kian government's punitive measures against 
Jews. They placed Jan in a Catholic orphan- 
age at the age of eight, in a further attempt 
to keep him from being deported. Through 
several moves and periods of separation, 
the family survived the Second World War, 
and re-established a comfortable lifestyle in 
Bratislava in 1945. The democratic republic 
of Czechoslovakia had been reinstated after 
years of occupation and division during the 
war, but upheaval returned in 1948 when the 
Communist Party gained political power and 
quickly imposed a totalitarian system. 

Vilcek decided early on to pursue 
research. He joined the Institute of Virology 
immediately after graduating from medical 
school in 1957, and soon published a single- 
author paper in Nature (J. Vilcek Nature 187, 
73-74; 1960). The young scientist doggedly 
established contacts with Western research- 
ers, and these efforts paid off when he and 
Marica defected in 1964. Carrying no 


Love and Science: 
A Memoir 

JAN VILCEK 

Seven Stories: 2016. 
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> official documents, they were waved 
across the border from Austria to West 
Germany — a scene rendered poignant 
in light of the current European refugee 
crisis. The ease with which Vilcek 
procured job offers from US colleagues is 
also hard to imagine now. He joined New 
York University as an assistant professor 
with no interview, a beneficiary of the 
US government's competitive research 
investments during the cold war. 

The final section of the book I found 
the most original. Here, Vilcek reverts 
to the story of Remicade’s success, and 
his unexpected wealth. His first quar- 
terly royalty payment in 1999 was just 
less than his annual salary as a professor; 
by 2005, the portion of his future royal- 
ties pledged to New York University was 
projected to eventually reach more than 
US$105 million. At first, Jan and Marica 
ate at restaurants and caught taxis more 
often; they helped family and friends, and 
replaced their second-hand furniture with 
European art-deco pieces. But they had no 
interest in luxury living, so as the royalties 
grew, they established what would become 
the Vilcek Foundation in 2000. 

They began by supporting their work- 
places, the New York University School of 
Medicine and the Metropolitan Museum 
of Art. They endowed professorships, lab 
space, scholarships and curatorships, earn- 
ing a place among the top 15 US philan- 
thropists of 2005. After the terrorist attacks 
of 11 September 2001, they began to see 
chinks in the US welcome for foreigners. 
Accordingly, in 2006 they launched annual 
Vilcek Prizes to recognize extraordinary 
achievements by immigrants to the United 
States in biomedical science and the arts 
and humanities. Aware that many of 
their prizewinners, from cancer biologist 
Joan Massagué to cellist Yo-Yo Ma, were 
already lauded, the Vilceks established 
annual prizes “for creative promise” in 
2009 to honour up-and-coming talent. 
Prizewinners so far have included the 
Iranian-American scientist Pardis Sabeti, 
for her work on pathogen evolution (see 
N.L. Yozwiak Nature 518, 477-479; 2015). 

This story is told humbly, with honest 
insight into the deliberations between the 
Vilceks and generous credit to the many 
people who have advised them. The cou- 
ple clearly delights in following the careers 
of the awardees. Jan Vilcek is now in his 
early eighties, and he mentions in Love 
and Science how he and Marica have made 
provision for the foundation to continue 
when they can no longer administer it. 
One senses that he is happy with his legacy, 
both scientific and benevolent. = 


Marian Turner is senior News e Views 
editor for Nature. 
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Density fluctuations in the Universe (shown in a computer simulation) have a sponge-like topology. 


ASTRONOMY 


Topology quest 


Michael Blanton enjoys a history of cosmology focused 
on large-scale structure in the ‘spongy’ Universe. 


ver the past half-century, astrophysi- 
():« and cosmologists have revealed 

howstructure formed in the Universe. 
This has vast importance: we exist only 
because tiny initial fluctuations in the density 
of matter grew through gravity to form gal- 
axies, stars and planets. Key pieces of physics, 
astronomy and mathematics have only in 
recent decades been combined into a scientifi- 
cally coherent and convincing origin story. In 
The Cosmic Web, astrophysicist Richard Gott, 
a compelling teller of cosmological stories, 
describes his part in this scientific quest. 

He begins with a brisk primer on the early 
history of cosmology, starting with the dis- 
covery of the expansion of the Universe by US 
astronomer Edwin Hubble and others in the 
1920s. The second half of the twentieth cen- 
tury, he shows, brought the detection by Arno 
Penzias and Robert Wilson of the cosmic 
microwave background radiation left over 
from the very early Universe (predicted by 
George Gamow and others), and the realiza- 
tion that fluctuations in this radiation would 
reveal the irregularities in density that seeded 
the growth of large-scale cosmic structure. 

This is when Gott’s work enters the picture. 
Beginning with his high-school obsession, 
topology — how geometric shapes can be 
connected — the book comes into focus. We 
start to follow Gott’s fascinating path towards 
a better understanding of large-scale cosmic 
structure. Remarkably, his early discoveries 
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led to a paper in American Mathematical 
Monthly when Gott was just 20, on anew class 
of regular ‘pseudopolyhedrons’ with sponge- 
like topology (J. R. Gott Am. Math. Mon. 74, 
497-504; 1967). These polyhedrons are infi- 
nite, and their surfaces divide space into two 
identically shaped, interlocked regions. As in 
a sponge, each region (the ‘sponge’ and the 
‘holes in it) is fully connected. If the Universe 
is separated into regions greater than and less 
than the mean density, it shows the same 
sponge-like topology, although with irregu- 
lar patterns. Gott's enduring (and endearing) 
enthusiasm for this connection carries the 
reader through the more technical parts of 
the book. 

The Cosmic Web is not a complete or rep- 
resentative intellectual 
history of twentieth- or 
twenty-first-century 
cosmology. Gott is 
rigorous in assigning 
credit for findings, 
but tells the history 
P through the lens of his 
| collaborations. This 

could leave a reader 
with the impression 
that a relatively small 
group of scientists is 
responsible for the 
discovery of dark mat- 
ter, the development 


The Cosmic 


Web: Mysterious 
Architecture of the 
Universe 

J. RICHARD GOTT 
Princeton University 
Press: 2016. 
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of structure-formation theory and progress 
on unravelling galaxy formation. Gott’s 
major protagonists (including James Gunn, 
Jeremiah Ostriker and James Peebles) did 
make key discoveries, but in the context of 
a broader group than Gott describes. Gott 
also gives only a partial account of the crea- 
tion and development ofa key contemporary 
effort: work on large-scale structure and map- 
ping of the cosmic web by the Sloan Digital 
Sky Survey (SDSS) — which uses the optical 
telescope at Apache Point Observatory, New 
Mexico — to create the most detailed 3D 
maps of the Universe yet, and continues to 
map further regions and earlier times. 

Gott’s is a personal view of cosmological 
developments, giving space and life to reali- 
ties of research. This evokes the best of phys- 
icist and science historian Abraham Pais’s 
more comprehensive books, such as Inward 
Bound (Clarendon/Oxford Univ. Press, 
1988). Unafraid of spending a page explain- 
ing an idea that turns out to be wrong, Gott 
engagingly describes the blind alleys along 
the way to current views. He presents figures 
and equations stripped to their fundamental 
forms, but avoids the common temptation to 
water them down unrecognizably. 

His work shines with this approach. As 
3D maps of the Universe emerged from the 
1980s onwards, revealed by surveys such 
as the SDSS and its forerunners, and as the 
theoretical picture of cosmology developed, 


it became clear that 

“Gott’sisa the pattern of den- 
personal view sity fluctuations in 
of cosmological the Universe is a 
developments, pot ane ete 
iving spaceand ‘7, ¥ SPOn8e- ike 
life se a lities of web. Gott’s youthful 
resourch.” work had primed 


him to grasp this, 
and gave him the 
mathematical tools to test it. This aspect of 
cosmology is now an essential facet of our 
knowledge, and our fullest understanding 
of the topology of large-scale structure is 
derived from techniques that Gott developed. 
Gott’s journey shows how scientists can be 
so motivated by their earliest obsessions that 
they persist in pursuing them — and how 
unique obsessions can let them bring some- 
thing new to the crowded table of ideas. For 
working scientists, this book is a reminder 
of what drives us, the value of chasing down 
questions that only we would ask, and how 
circuitous that chase can be. The Cosmic Web 
is not just a well-told story about the fron- 
tiers of cosmological knowledge. It is also an 
inspiration to explore them further. m 


Michael Blanton is an associate professor 
in the Center for Cosmology and Particle 
Physics at New York University, and is 
director of the Sloan Digital Sky Survey IV. 
e-mail: michael. blanton@gmail.com 
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Books in brief 


In a Different Key: The Story of Autism 

John Donvan and Caren Zucker CROWN (2016) 

Hot on the heels of Steve Silberman’s Neurotribes (Avery, 2015; see 
Nature 524, 288-289; 2015) comes another monumental history of 
autism. Journalists John Donvan and Caren Zucker (whose television 
series Echoes of Autism broke ground in the 2000s) trace the halting 
progress in understanding the condition, weaving in stories of, 
among others, Donald Triplett, the first person diagnosed as autistic; 
US medic Leo Kanner; and UK psychiatrist Lorna Wing. The scoop 

is historian Herwig Czech’s claim that key player Hans Asperger 
contributed to Nazi ‘social cleansing’ of children deemed mentally ill. 


Rebooting India: Realizing a Billion Aspirations 

Nandan Nilekani and Viral Shah ALLEN LANE (2016) 

India’s telecoms network is the world’s second largest, yet the 
country’s rural poor often effectively run marathons in accessing 
services. In this audacious technological manifesto — now in a UK 
edition — entrepreneur Nandan Nilekani and software specialist Viral 
Shah argue that India’s vast challenges in banking, health care and 
other systems are solvable, if approached like government start- 

ups run by handpicked entrepreneurs. Nilekani and Shah cite the 
e-Aadhaar biometrics identification card, open to all Indians, as a first 
step on the road to digitally streamlined governance and society. 


The Naked Shore: Of the North Sea 

Tom Blass BLOOMSBURY (2016) 

Compared to the charismatic Mediterranean, Europe’s North Sea 
can seem a workaday tract of “mists, miasmas and surliness”, 
notes journalist Tom Blass. But in trawling its depths for this vivid 
travelogue, Blass dredges up nuggets from an eventful cultural, 
military, industrial, economic and ancient history. Launching from 
the estuarine murk of the Thames, he takes in Dutch polders (tracts 
of land dubbed ‘waking’, ‘sleeping’ or ‘dreaming’, depending on 
distance from the sea), Frisian dialects, fishing fracases, puffins, 
porpoises and a future of warming waters and giant wind farms. 


Six Steps Back to the Land: Why We Need Small Mixed Farms and 
Millions More Farmers 

Colin Tudge GREEN (2016) 

The much-chewed-over conundrum of how to feed a projected 
global population of 10 billion demands an agrarian renaissance led 
by scientifically run, small, mixed farms, avers biologist Colin Tudge. 
Using principles such as agro-ecology and a focus on “lots of plants, 
not much meat, and maximum variety”, Tudge’s transformative 
prescription is fresh, pragmatic and packed with cutting-edge 
science. It is impressive, too, for its sound circular-economy thinking, 
not least an emphasis on an informed food culture. 


The Wheel: Inventions and Reinventions 

Richard W. Bulliet COLUMBIA UNIVERSITY PRESS (2016) 

As innovations go, the wheel might seem the ubiquitous driver 

of cultural evolution. But Richard Bulliet’s technological history 
reveals complexities. The wheel was invented 3 times (with fixed 

and moving wheels on axles both emerging 5,000 years ago, and 
casters a mere 300); defined differently by the rail and automobile 
industries; and eschewed by civilizations such as the pre-Columbian 
Olmecs. Hitched in turn to rickshaws, ox carts, Mongolian mobile 
homes, barouches and bicycles, this is a deft narrative. Barbara Kiser 
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When brain bullets 
met crowdfunding 


Stereotaxy — a surgical technique 
that uses a head clamp to pinpoint 
brain areas from coordinates 

of external landmarks and 
anatomical structures — was first 
applied in the nineteenth century, 
when it went largely unnoticed 
by the medical community. It was 
‘reinvented’ 50 years later and is 
routinely used today, for example 
in deep-brain stimulation for 
diseases such as Parkinsons. 

Gaston Contremoulins 
(1869-1950) was the self- 
educated physicist who invented 
the ‘metroradiographic frame 
that marked the birth of 
stereotactic surgery (pictured). 
It was first used in 1897 to guide 
the removal of bullets from the 
brains of two young men who 
had survived after shooting 
themselves in the head. 

This surgical triumph was 
crowdfunded following a public 
appeal by Contremoulins. It 
was reported in the French 
weekly newspaper /Illustration 
and aroused great popular 
interest. In the scientific press, 
it attracted just a single report 
— in the proceedings of the 
French Academy of Sciences, 
Compte-rendus Hebdomadaires 
de l'Académie des Sciences. 
Caroline Apra, Pierre 
Bourdillon, Marc Lévéque 
Pitié-Salpétriére Hospital, Paris, 
France. 
marc.leveque@aphp.fr 


IPBES disciplinary 
gaps still gaping 


The Intergovernmental Platform 

on Biodiversity and Ecosystem 

Services (IPBES) convenes this 

month to approve summaries 

for policymakers of the first 

assessments at its fourth plenary. 
Despite early calls for IPBES 

to draw on a broader range 

of disciplines than did the 

Intergovernmental Panel on 

Climate Change (see E. Turnhout 

et al. Nature 488, 454-455; 

2012), the social sciences and 


the humanities remain markedly 
under-represented. They make up 
less than 10% of the membership 
of IPBES expert groups, instead 
of the recommended 30%. These 
disciplines should play a bigger 
part in IPBES assessments and 

in implementing the first IPBES 
work programme for 2014-18. 

The imbalance mirrors 
institutional and knowledge 
barriers between research 
disciplines. The IPBES Secretariat 
and its Multidisciplinary Expert 
Panel need to consult more 
experts from the social sciences 
and humanities for nominations 
for assessments. The panel should 
encourage these stakeholders to 
engage in scoping and reviewing 
activities and to register on IPBES 
networking sites. 

One of the IPBES objectives is 
to include experts with “balance 
in the terrestrial and marine 
natural sciences, social and 
economic sciences, and arts and 
humanities”. The secretariat 
should review the disciplinary 
balance of all IPBES activities 
and products, and make the 
findings publicly available. 

Alice B. M. Vadrot University of 
Cambridge, UK. 

Jens Jetzkowitz Philipps 
University Marburg, Germany. 
Lindsay C. Stringer University of 
Leeds, UK. 

L. stringer@leeds.ac.uk 
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More to fisheries 
than catch limits 


You suggest that the European 
Union is setting a worrying trend 
by ignoring scientific advice on 
overfishing, but that is only part of 
the story (Nature 528, 435; 2015). 
The advice you mention refers 
to catch levels that are in line with 
the objectives of the EU Common 
Fisheries Policy. Those are based 
on the fish-stock biomass that 
can deliver maximum sustainable 
yields. Scientific advice on the 
annual catch limits (known as 
total allowable catches, or TACs) 
is aimed at delivering maximum 
sustainable yield and is provided 
by the International Council for 
the Exploration of the Sea (ICES). 
However, the TACs are set by the 
EU Council of Ministers and, as 
you point out, can sometimes 
exceed the levels advised by ICES. 
Nevertheless, to accuse the 
EU of “ignoring” scientific advice 
in such cases oversimplifies 
the decision-making process. 
In fact, the extent to which 
TACs have exceeded the levels 
recommended by ICES has 
decreased since 2001, as you note. 
Other objectives in the fisheries 
policy relate to their economic 
and social benefits. They aim to 
ensure the availability of food 
supplies and to contribute to a fair 
living standard for those who are 


dependent on fishing. Decision- 
making for fisheries management 
is therefore more complex than 
simply setting catch limits. 

John Casey, Jann T. 
Martinsohn, Hendrik Dérner 
European Commission — JRC 
Institute for the Protection and 
Security of the Citizen, Ispra, Italy. 
john.casey@jrc.ec.europa.eu 


Metrics needed to 
track adaptation 


A lesser-known success of the 
Paris climate agreement in 2015 
is the ‘global adaptation goal; 

an ambitious plan for adapting 
to climate change that reaches 
beyond national boundaries. 
This is important because 
climate-change mitigation needs 
to take the world’s adaptation 
potential into account. 

To sustain the long-term goal 
of keeping the global average 
temperature rise well below 
2°C, we also need evidence 
that the world can adapt to 
the impacts of warming. The 
Paris agreement aims to build 
a collective understanding of 
adaptation through metrics and 
tools that capture each country’s 
efforts. Aggregating national 
contributions as a global trend 
will indicate whether humankind 
is on track to adapt. 

The first step will be the 
agreement’ ratification in April 
by at least 55 parties to the United 
Nations Framework Convention 
on Climate Change (UNFCCC) 
— which together account for at 
least 55% of total greenhouse-gas 
emissions. 

It will next be necessary to 
define metrics that both reflect 
national circumstances and 
allow aggregation. To this end, 
scientists tracking adaptation 
will need to work with experts at 
the UNFCCC and organizations 
suchas the UN Environment 
Programme. 

Alexandre K. Magnan Institute 
for Sustainable Development and 
International Relations, Paris, 
France. 
alexandre.magnan@iddri.org 
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SCHIZOPHRENIA 


From genetics to physiology at last 


The identification of a set of genetic variations that are strongly associated with the risk of developing schizophrenia 
provides insights into the neurobiology of this destructive disease. SEE ARTICLE P.177 


RYAN S. DHINDSA & DAVID B. GOLDSTEIN 


chizophrenia is a devastating and chronic 

neuropsychiatric disorder that affects 

nearly 1% of the world’s population. It 
has long been hoped that the identification of 
genetic risk factors for schizophrenia would 
help to identify the physiological causes of 
the disease, but, despite decades of intensive 
research, the biological underpinnings of schiz- 
ophrenia have remained elusive. On page 177 
of the issue, Sekar et al.' present a remarkable 
genomic and neurobiological study that finally 
delivers on this long-standing hope. 

There are two typical paths to drug discovery. 
The first is serendipity — sometimes research- 
ers simply get lucky and find compounds that 
happen to work. Although such fortunate acci- 
dents have led to the development of some of 
medicine's most important drugs, this method 
of discovery cannot be systematized, and the 
resulting treatments often fall well short of a 
cure. Current treatments for schizophrenia 
fall uniformly into this category. The second, 
preferable, path is to understand the mecha- 
nisms that underpin a disease, and to design 
treatments that target those underlying causes. 

One of the main problems in deciphering 
the molecular basis of schizophrenia is that 
there is a near complete absence of clearly 
associated biological changes. However, schiz- 
ophrenia is a strongly genetic disorder, and for 
decades many have believed that understand- 
ing the genetics involved might provide a way 
to begin to dissect the biology of the disease. 
Until now, following this line of reasoning has 
been a largely frustrating experience. Before 
the advent of genome-wide association stud- 
ies (GWAS), which link disease risk to specific 
regions of the genome, the field was plagued 
by a lack of statistical rigour and false starts, 
as researchers reported hundreds of putative 
associations between genetic variations and 
the risk of schizophrenia’. But GWAS have 
demonstrated that these early associations 
were largely incorrect*”. 

Careful GWAS have now finally identified 
real associations between genetic variants 
and schizophrenia. In particular, a 2014 study 
that used a huge sample size and strict statis- 
tical methods revealed that there are at least 
108 genetic regions (loci) associated with a risk 
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Figure 1 | A complex association. Variations in numerous regions of the genome have been associated 
with an increased risk of schizophrenia, with many of the strongest associations occurring in the major 
histocompatibility complex (MHC). This region is challenging to study because it is 3.6 million bases 
long and complex, containing three classes of gene that have differing roles in immunity. Genetic loci 
that are associated with risk are found along the entire region (only three of the strongest associations are 
shown here, for simplicity). However, genetic variants in C4A and C4B are most strongly associated with 
increased risk. Sekar et al.' overcame the challenges of studying the MHC to pinpoint how variations in 
the C4 genes that alter gene expression can increase risk. 


of schizophrenia’. However, finding a locus is 
not the same as identifying a causal gene — 
for instance, loci are often located in sequences 
that don’t encode genes, and many different 
variations can occur in one region, making it 
hard to pinpoint what exactly is driving the risk 
signal. Indeed, these 108 risk factors were not 
traced to specific genes or variant sequences. 
Without that information, a clear insight 
into the molecular aetiology of the disease 
remained lacking, but Sekar et al. have now 
taken that crucial next step. 

The strongest risk association for schizo- 
phrenia is found in the major histocom- 
patibility complex (MHC) — a region on 
chromosome 6 that contains genes involved in 
acquired immunity (Fig. 1). Within the MHC, 
the strongest risks are associated with loci near 
the gene C4, which encodes a complement 
factor (a part of the innate immune system). 
This association motivated the authors to ask 
whether variations in C4 might be involved in 
schizophrenia risk. 

In humans, C4 exists as two distinct genes, 
C4A and C4B, and the number of copies of 
each gene varies from person to person. To 
complicate things further, there are long and 
short versions of C4A and C4B, which are 
determined by the presence or absence of a 
human endogenous retroviral (HERV) inser- 
tion in a non-coding region of the gene. Sekar 
et al. developed a way to accurately assess 
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the number of copies of each gene, and then 
used RNA-expression data to relate gene copy 
number and HERV status to C4A and C4B 
gene-expression levels. 

With these data in hand, the authors present 
compelling evidence that they have identified 
the underlying cause of one of the main MHC 
risk associations. They have found that vari- 
ation at C4 is associated with schizophrenia 
risk independently of other genetic variations 
in the MHC region. Specifically, different lev- 
els of risk are associated with different genetic 
combinations of C4 copy number and HERV 
status. The authors used RNA data to show 
that expression of C4A and C4B increases 
with copy number, and that the presence of 
the HERV insertion increases the ratio of 
C4A to C4B expression. Moreover, the higher 
the levels of C4A expression, the greater the 
risk of schizophrenia. 

It is worth emphasizing that defining causal 
variants in the MHC regions is one of the most 
challenging problems in human statistical 
genetics. The MHC is 3.6 million bases long, 
and variants associated with schizophrenia 
risk have been found across the entire region. 
Indeed, the region is so challenging that 
geneticists often joke that they hope to find 
associations anywhere except in the MHC. 
Caution is imperative when trying to pin- 
point causation in such a genetically complex 
region. However, Sekar and colleagues seem 


to have successfully overcome these challenges. 

The complement pathway has been impli- 
cated in synaptic pruning’, a developmental 
process in which the synaptic connections 
between neurons are continuously elimi- 
nated in the brain until early adulthood. 
Using a mouse model, Sekar et al. found that 
C4 expression is upregulated during peri- 
ods of synaptic pruning. By contrast, mice 
deficient in C4 showed signs of decreased 
pruning. Thus, the authors postulate that 
increased C4A expression in individuals with 
schizophrenia results in increased synaptic 
pruning. Interestingly, studies of the brains 
of humans with schizophrenia have shown 
that affected individuals exhibit thinning 
and reduced synaptic structures in the corti- 
cal region of the brain compared with people 
without the disorder®*. Hyperactive synaptic 


CLIMATE SCIENCE 


pruning might explain these findings. 

Unfortunately, because mice lack the two 
forms of C4 found in humans, the question 
of why schizophrenia risk depends specifi- 
cally on C4A expression levels remains open. 
Answering this question is now a priority, and 
can be expected to provide further mechanistic 
information. 

Sekar and colleagues’ study finally gives us 
the first real inroad into the molecular aetiol- 
ogy of schizophrenia, and perhaps a direction 
for the development of treatments. Although 
pruning undoubtedly represents a challeng- 
ing therapeutic target, the authors’ beautiful 
and comprehensive study gives much-needed. 
inspiration for all those researchers who are 
trying to leverage genetics to advance our 
understanding of the biology of neuropsychi- 
atric diseases. m 


A great Arctic ice shelf 


Newly mapped features on the floor of the Arctic Ocean suggest that the Arctic 
basin was once covered by a one-kilometre-thick, flowing ice shelf derived from 
large ice sheets in eastern Siberia, Arctic Canada and the Barents Sea. 


EUGENE DOMACK 


commonly been viewed as floating sea ice, 

formed as a relatively thin veneer from the 
freezing of seawater. Much effort has been 
directed towards understanding the behav- 
iour of this form of ice cover and its recent 
precipitous decline’. But there has been a long- 
standing debate about a different kind of ice — 
ice of a thicker, more terrestrial origin, that 
might have existed across the Arctic basin in 
the distant past”. Writing in Nature Communi- 
cations, Jakobsson et al. bring this ancient ice 
shelf to life from a set of remarkable profiles 


| ce cover across the Arctic Ocean has 


Canadian 
Arctic islands 


Amerasian 


of elevated ridges that stand above the deep 
basins of the Arctic Ocean. 

Early images’ of portions of the Arctic sea 
floor revealed parallel grooves, scours and 
rumples — the telltale signs of ice having 
contact with and ploughing the sea floor. But 
the small number of features and their large 
geographic separation suggested to most 
scientists that floating and drifting icebergs 
were to blame”. Jakobsson and colleagues 
demonstrate that the ploughing pattern 
observed on widely separated ridges reflects a 
consistent and uniform direction of ice move- 
ment across the entire basin, as others have 
also reported’. 
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This key set of observations can be explained 
only by the presence of a single, widespread ice 
shelf (Fig. 1), one that flowed freely between 
large ridges, but became grounded and left 
its trail across the shallower regions of the 
Arctic basin. The ice shelf — estimated at 
some 4.1 million square kilometres on the 
basis of the numbers given in Jakobsson and 
colleagues’ paper — was four times the area 
of all the ice shelves now found in Antarctica, 
including the immense Ross and Ronne-Filch- 
ner ice shelves. It must have been about 1 kilo- 
metre thick, allowing it to float over the great 
depths of the Amerasian and Eurasian basins, 
but touching down and ploughing across the 
shallower regions. 

By tracing the glacial furrows towards their 
points of origin, Jakobsson and co-workers 
conclude that land-based ice sheets must have 
fed the ice shelf from three separate centres: 
the eastern Siberian lowlands; the Scandina- 
vian regions of the Barents Sea; and high Arctic 
islands in north central Canada (Fig. 1). Fur- 
thermore, the authors found evidence — in the 
form of microfossils and of magnetic proper- 
ties in sedimentary cores collected from some 


Figure 1 | Sketch of an ancient ice shelf across the Arctic Ocean. Jakobsson et al.’ report evidence of a one-kilometre-thick ice shelf (grey region) that covered the 
entire Arctic basin during the penultimate glaciation. Blue arrows indicate feeding ice streams from the Canadian Arctic islands, Siberia and Svalbard; grey arrows 
represent flow lines of the ice shelf. Modern locations of Greenland, Svalbard, Alaska and Siberia are shown in pale blue; green asterisk indicates the location of the 
New Siberian Islands. The rough locations of the Amerasian and Eurasian basins beneath the Arctic Ocean are also indicated. (Figure adapted from ref. 3.) 
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of the ridges — that this Great Arctic Ice Shelf 
(GAIS) existed not during the most recent 
glaciation (the Wisconsinan, about 22,000 to 
18,000 years ago), but instead about 160,000 
to 140,000 years ago, during the penultimate 
glaciation. 

The antiquity of the GAIS helps to explain 
why some evidence is missing for the ice sheets 
that fed it. The regions from which the ice was 
derived are now shallow, broad shelves and 
coastal plains of Canada, Russia and north- 
western Europe. During the Wisconsinan and 
the subsequent sea-level rise, these regions 
were exposed to erosion and sedimentation, 
which may have masked the direct footprint of 
the ice sheets’. Even so, some regions that are 
clearly in the path of the ice sheets that covered 
eastern Russia have yielded no evidence of gla- 
ciation at all®*. These areas include the remote 
New Siberian Islands, which may not have yet 
undergone complete geological examination’. 

An ice shelf of the extent suggested by 
Jakobsson and colleagues’ work is very differ- 
ent from the expansive sea ice that covers the 
Arctic Ocean today. Sea ice constantly opens 
and closes, providing gaps for light; this allows 
photosynthesis to occur and thus to fuel the 
vast Arctic marine ecosystem. Just as scientists 
contemplate the effect of more open water 
as Arctic sea ice is progressively reduced in 
response to climate change, an equally daunt- 
ing task is to imagine how the ecosystem 
could function under the continual darkness 
and cover that the GAIS would have caused. 
A more limiting scenario for life could not be 
imagined, with the exception of the subglacial 
lakes that lie beneath the Antarctic ice sheet. 

Equally important is the role that the GAIS 
must have had in regulating ocean chemistry 
while not greatly changing sea level. The impli- 
cations of the changes in ocean chemistry for 
estimates of ancient sea levels can be appreci- 
ated by considering how climate scientists con- 
ventionally reconstruct sea-level oscillations. 
To reconstruct ancient glaciations largely lack- 
ing geological features that directly indicate 
shorelines, scientists have used the geochemi- 
cal history of the ocean recorded in the shells 
of marine plankton called foraminifera. Glacial 
oceans are enriched in heavy-oxygen atoms 
('O), and so the change in chemistry over 
time — measured as the ratio of light-oxygen 
atoms to heavy oxygen in foraminifera shells 
— is used to gauge sea-level changes caused 
by the comings and goings of large ice sheets”. 

This method has been validated for the most 
recent glacial cycle, because direct measures 
of sea level are an exceedingly good match 
to estimates based on ice-sheet volume and 
changes in ocean chemistry. But estimates for 
sea-level change during the penultimate gla- 
ciation have always been problematic, because 
the few direct shoreline records of sea level are 
higher than estimates based on the oxygen- 
isotope record corresponding to that period”. 
Jakobsson and co-workers’ reconstructed 


GAIS seems to explain this mismatch, and 
reminds climate scientists that they should 
check their assumptions as they continue to 
account for glacial ice and its effects on sea level 
during the numerous glacial episodes of the 
past 2.5 million years. m 
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Out with the old 


The selective elimination of cells that have adopted an irreversible, senescent 
state has now been shown to extend the lifespan of mice and to ameliorate some 
age-related disease processes. SEE ARTICLE P.184 


JESUS GIL & DOMINIC J. WITHERS 


he ability to fight the ageing process 
has been a long-held human desire. 
Although this quest often seems to 
be driven by vanity, ageing is the major risk 
factor for many of the diseases that plague 
modern society. More than 50 years ago, it 
was suggested that ageing is linked to a state 
of arrested cell growth known as senescencel, 
but this link has remained unproven, and the 
molecular basis for organismal ageing has been 
elusive. In this issue, Baker et al.’ (page 184) 
demonstrate that the removal of senescent cells 
does indeed delay ageing and increase healthy 
lifespan (healthspan). 
Senescence is a cellular state in which 
cells permanently stop dividing. It is medi- 
ated by two signalling pathways — the 


p53 pathway and the p16"“*"-Rb pathway. 
Senescent cells secrete a complex cocktail 
of factors called the senescence-associated 
secretory phenotype (SASP), which includes 
matrix metalloproteinases (enzymes that 
break down the extracellular matrix) and pro- 
inflammatory signalling molecules. Such cells 
have been shown to accumulate during ageing, 
and their presence has been associated with a 
broad range of diseases, including diabetes, 
kidney disease and many cancers’. 

The group that performed the current study 
previously showed that removing senescent 
cells from a mouse model of accelerated age- 
ing delays the onset of several disease-related 
processes*. However, the relevance of these 
observations to the normal ageing process was 
unclear. Baker et al. have now directly tackled 
this uncertainty using a genetically engineered 


Ageing mouse lacking 
senescent cells 
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Figure 1 | Improving healthspan. Senescent cells, which are in a state of irreversible growth arrest, 
accumulate in various organs during ageing and are associated with age-related diseases in many tissues. 
Baker et al.” selectively eliminated senescent cells in ageing mice. This increased healthy lifespan, 
reducing many age-related, disease-associated abnormalities. 
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mouse model that they had developed 
previously’, called INK-ATTAC. These mice 
produce a caspase enzyme specifically in cells 
that express the p16’ gene. The caspase can be 
activated by the injection of a drug; the activated 
caspase then triggers cell death, eliminating 
senescent cells in which it is expressed. 

Baker and colleagues found that the elimi- 
nation of p16""“*-expressing cells increased 
lifespan, regardless of the sex or strain of 
mouse examined, and ameliorated a range of 
age-dependent, disease-related abnormalities, 
including kidney dysfunction and abnormali- 
ties in heart and fat tissue (Fig. 1). The authors 
observed increased activity and exploratory 
behaviour and a decrease in the incidence of 
cataracts (although this improvement was 
strain-dependent). Senescent-cell removal also 
delayed the onset of cancer, without affecting 
the range of observed tumour types. Together, 
these findings suggest that the accumulation of 
p16"*"*-expressing cells during normal ageing 
shortens healthspan. 

The INK-ATTAC mouse is a powerful 
model with which to investigate the physi- 
ological relevance of senescence, but it is not 
without limitations. For instance, the model 
is assumed to selectively eliminate senes- 
cent cells — and although not all p16'""“*- 
expressing cells are necessarily senescent, the 
ATTAC transgene that produces the caspase 
seems to be expressed only in senescent cells. 
However, it could be that drug treatment kills 
only ‘late senescence’ cells’, which express 
high levels of p16" and ATTAC, rather than 
triggering a more general elimination of senes- 
cence. Moreover, drug treatment does not kill 
some senescent cells, including immune cells 
called lymphocytes as well as liver and colon 
cells, which limits the reach of the model. An 
improved characterization of the cell types 
that are eliminated is needed to fully under- 
stand the basis of the extended healthspan of 
these animals. 

Another caveat is that the inducible 
elimination of senescent cells requires twice- 
weekly, long-term injections into the abdo- 
men. Males that were injected with a control 
solution rather than the drug typically had 
shorter lifespans than normal mice, perhaps 
because of this intensive treatment regime. 
More-sophisticated model animals, in which 
senescent cells can be ablated in different 
tissues at different times and without the need 
for repeated injections, would help to extend 
the current findings. 

Although the ablation of senescent cells 
ameliorates some age-related defects, it has no 
effect on others, including declines in motor 
performance, muscle strength and memory. 
This could reflect limitations of the ATTAC 
model. However, it might also suggest that 
senescent cells are involved in the progression 
of only some diseases. 

Why might eliminating just the few cells 
that are senescent have beneficial effects 


in a range of tissues? Baker and colleagues’ 
analysis of the kidney might help to explain 
this observation and clarify why senescent 
cells can be so disruptive during ageing. A 
striking disease-associated change often arises 
in aged kidneys, in capillary networks called 
glomeruli. However, the authors observed 
senescence primarily in another cell type, 
the epithelial cells of the kidney tubules. This 
suggests that SASP components secreted by 
epithelial cells could be responsible for disease 
in the glomeruli. 

A search for compounds that can selectively 
eliminate senescent cells is under way”, and 
could be an important step in translating the 
findings of Baker and colleagues’ study to 
combating diseases of ageing in humans. An 
alternative therapeutic approach could be to 
repress the SASP. Indeed, inhibition of JAK 
proteins, which mediate the actions of some 
cytokines (a type of signalling molecule), 
reduces the SASP and alleviates frailty in old 
mice®. Rapamycin, a drug that is used as an 
immunosuppressant in humans, also robustly 
extends mouse lifespan’ and regulates the 
SASP!"". Thus, common therapeutic mecha- 
nisms acting on the SASP might underlie the 
beneficial effects of both rapamycin and senes- 
cent-cell ablation on lifespan and healthspan. 

It is worth noting that senescence is a pro- 
tective response that limits tissue scarring 
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(fibrosis) and cancer. Cells that express senes- 
cence markers are also involved in wound 
healing. Interestingly, the current study sug- 
gests that, although ablating senescent cells 
impairs wound healing, in general it has 
limited negative effects, and the authors found 
no evidence for increased fibrosis or cancer 
development. Nonetheless, any future senes- 
cence-based therapies must take care to control 
for possible detrimental consequences. m 
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Compact coupling for a 
two-stage accelerator 


Injecting electrons that have been accelerated by a laser- powered plasma wave into 
a second plasma wave represents a two-step electron accelerator. With 100 such 
steps, collider applications become possible. SEE LETTER P.190 


BRIGITTE CROS 


aser-plasma acceleration is a means 
L« accelerating electrons in a plasma 

— a fluid of positively and negatively 
charged particles — using electric fields that 
are 1,000 times stronger than those of con- 
ventional particle accelerators operating in a 
vacuum. However, despite major advances in 
the past few years, reliable laser-plasma accel- 
erators have still not been built. On page 190 
of this issue, Steinke et al.’ describe a way in 
which the coupling of electron beams can 
be controlled with plasmas. The authors’ 
achievement could lead to the development 
of laser-driven electron accelerators that can 
be scaled up to achieve energy gains on a par 
with those of today’s particle accelerators. Such 
electron accelerators would fit on a tabletop, 


and could replace conventional machines at 
medical facilities, for instance. 

Plasma waves — collective oscillations of 
electrons on a background of heavier, sta- 
tionary ions — are intrinsic characteristics 
of plasmas. The oscillating electrons lead to a 
charge separation in the plasma wave that is 
associated with an electric field of up to several 
hundred gigavolts per metre. Plasma waves can 
be generated by the application of an intense, 
ultrashort laser pulse’ (Fig. 1), which acts like 
a snow plough to expel electrons from the most 
intense regions of the plasma. 

Depending on the laser intensity, plasma 
waves exist in either the quasilinear or the 
nonlinear regime (according to the wave equa- 
tions that describe them). In the quasilinear 
case, plasma waves provide a periodic accel- 
erating structure for injected particle beams 
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(Fig. 1). Alternatively, nonlinear interactions 
between the laser pulse and the plasma elec- 
trons during laser propagation cause the 
plasma waves to form spherical cavities’ that 
can trap and accelerate a fraction of the plasma 
electrons. For example, electrons in plasmas 
are typically accelerated to energies in the 
100-megaelectronvolt range across milli- 
metre distances, for laser systems that have a 
maximum power of 30 terawatts (1 terawatt is 
10’? W). With the development of petawatt- 
class (10'° W) laser systems, acceleration in 
the nonlinear regime — extended to several 
centimetres by laser guiding — has generated 
multi-gigaelectronvolt electron bunches’. (For 
comparison, the Stanford Linear Accelerator 
Center in Menlo Park, California, produces 
electrons of 50 GeV over 3 km, whereas a 
plasma can achieve* 5 GeV in about 10cm.) 
To further increase the accelerated electron 
energy, a large electric field must be main- 
tained over the longest possible distance (up to 
metres for the quasilinear regime). Lowering 
the density of the electrons in the plasma helps 
to reduce nonlinear effects and to increase the 
acceleration distance, provided that the laser 
intensity is maintained at the required level 
over the whole distance. This can be achieved 
by guiding the laser beam externally, using 
either a preformed plasma structure or a gas- 
filled capillary tube. However, the interaction 
between the laser and the plasma causes a trans- 
fer of the laser beam’s energy to the plasma, 
leading to ‘laser depletion’. This problem, and 
the limitation on acceleration distance, can 
be resolved by using several plasma stages in 


Plasma 


Accelerated 
electrons 


Pulse movement 


sequence. Supplying a fresh laser beam for each 
subsequent accelerating stage makes the pro- 
cess scalable in electron energy and repetition 
rate, with a reasonable amount (10-100 joules) 
of laser energy per beam. 

Steinke et al. have combined an ensemble 
of state-of-the-art plasma designs to achieve 
what could potentially be developed into an 
all-plasma compact accelerator driven by 
lasers. For the first stage, they use the nonlinear 
regime to create the electron beam from a gas 
jet; the nonlinear regime is necessary because, 
in the quasilinear regime, no electrons from 
the plasma itself can be trapped and acceler- 
ated. The authors use the quasilinear regime 
for the second stage, at which the accelerated 
electrons from the first stage, having energies 
of 100 MeV, are injected. Notably, the authors’ 
components for coupling the injected electrons 
with the plasma wave and for focusing the laser 
have been reduced from the metre scale to the 
centimetre scale, with their entire set-up being 
ona scale of 10cm. 

The novelty of their scheme relies on the use 
of plasmas for every key element of the accel- 
erator: the electron source is produced when a 
laser pulse hits a jet of gas, which becomes ion- 
ized as a plasma and leads to an electron beam 
of short duration (typically 10 femtoseconds; 
1 femtosecond is 10°’ s); the electron beam is 
focused by a plasma lens that is produced bya 
column of plasma created by discharge inside a 
capillary (the magnetic field of the plasma lens 
symmetrically confines the electrons); and the 
electron beam is accelerated in a plasma wave 
that is driven by a pulsed laser beam focused 


Figure 1 | Structure of a plasma wave. An initial laser pulse propagating through a plasma displaces 
electrons (not shown) in the plasma. These electrons can generate a plasma wave in the ‘quasilinear’ 
regime, in which the wave has oscillating electric fields (longitudinal component shown by solid black 
arrows, transverse component by dotted black arrows). Injected electrons in tune with the accelerating 
phase (blue; decelerating phases in red) of the longitudinal wave and with the focusing fields pointing 
away from the horizontal axis — that is, within the black oblong — will be focused and accelerated. 
Steinke and co-authors’ show that such a plasma wave can accelerate an electron beam produced from a 
first-stage plasma accelerator. In principle, using more than two stages will further scale the energy gain. 
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with a plasma mirror (a mirror that reflects 
light above a threshold intensity at which the 
surface of the material is ionized). In contrast 
to previous experiments, two independent 
laser beams are used, driving two separate 
accelerating plasma stages. The consequence 
of using plasmas for each element is to 
maintain the overall compactness of the beam- 
line, thus preserving the global accelerating 
electric field. 

Although this experiment is a remarkable 
achievement, room for improvement remains. 
The electron beam was externally injected 
into the plasma wave and accelerated with 
a maximum energy gain of 100 MeV and a 
3.5% charge-coupling efficiency (the fraction 
of charge at the entrance of the second stage 
that will be accelerated at the exit). Steinke and 
colleagues present simulations showing that the 
trapping efficiency of electrons in the plasma 
wave might be increased to 90% by, for exam- 
ple, injecting an electron beam with optimized 
proportions specifically into the accelerating 
and focusing regions of the plasma wave. This 
could be achieved by using an electron beam 
(in which the spread of the electrons’ energy 
is smaller than that in Steinke and colleagues’ 
experiments) that is focused on a spot of 
typically less than 10 um, and coupling this with 
a plasma wave that is larger than the electron 
beam and that is excited by a laser with a uni- 
form amplitude (guided in a way that prevents 
oscillations). An energy gain in the 10-GeV 
range would be expected for metre-scale 
plasmas, although plasma waveguides of this 
size are not currently available. In addition 
to the quality of the coupling of the electron 
bunch as it enters the plasma, its stability in the 
plasma wave must be controlled precisely in 
space and time as it is accelerated, to achieve a 
high-quality electron beam. 

Although there is a long way to go before a 
laser-plasma accelerator with 100 stages can be 
built for collider applications, intermediate- 
energy accelerators in the 10-GeV range are 
a realistic possibility in the near future, as test 
facilities for accelerator physics and other appli- 
cations. Numerous research and development 
activities are under way worldwide, including 
collective efforts’ in Europe to design an inter- 
mediate-energy laser-plasma accelerator. Such 
activities hold the promise of shrinking the size 
and cost of future machines. m 
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TOMMY TRENCHARD/EUROPEAN MOBILE LABS 


Sequencing 
on the ground 


The West African Ebola epidemic that began 
in 2014, and still threatens to re-emerge, 
prompted a flurry of efforts to sequence 

the genome of the virus as it spread. 
Genomic information can contribute to 
epidemiological tracking, identify mutations 
that increase the disease’s virulence and 
inform therapeutic strategies. But sequences 
must be obtained fast if they are to help to 
control a raging epidemic. 

On page 228 of this issue, Quick et al. 
describe a portable, real-time sequencing 
system that they used in local, resource-poor 
sites in Guinea during 2015 (VJ. Quick et al. 
Nature 530, 228-232; 2016). The requisite 
equipment, including the tiny MinlON 
genome sequencer (pictured), fitted into 
50 kilograms of standard airline luggage. The 
researchers generated sequence data within 
24 hours of receiving a patient’s viral RNA 
sample — days to weeks less than it would 
have taken to receive data from a central 
sequencing facility. Marian Turner 
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Asymmetric breaks 
in DNA cause sterility 


The part that the mouse gene Prdm9 plays in generating double-strand breaks in 
DNA has now been linked to its putative role in speciation, thanks to experiments 
that use a ‘humanized’ version of the gene. SEE ARTICLE P.171 


JIRI FOREJT 


uring the specialized meiotic cell 
D divisions that produce mammalian 

eggs and sperm, an individual’s two 
sets of chromosomes are shuffled in a pro- 
cess called meiotic recombination, producing 
genetically unique chromosomes to pass on 
to offspring. Recombination tends to occur 
in DNA regions known as hotspots, the loca- 
tions of which are determined by a single 
DNA-binding protein, PRDM9. The presence 
of different versions (alleles) of the Prdm9 gene 
in two subspecies of mice renders the hybrid 
offspring of these subspecies sterile’, but how 
the gene exerts this effect has been unknown. 
On page 171 of this issue, Davies et al.” link 
these two roles of Prdm9. They find that it is 
the rapid subspecies-specific degradation 


of Prdm9 DNA-binding sites that underlies 
sterility in mouse hybrids. 

The male offspring of crosses between the 
mouse subspecies Mus musculus musculus 
and Mus musculus domesticus are sterile. It 
has been proposed that sterility arises for two 
reasons. First, the chromosomes from the 
two subspecies do not recognize one another 
normally because their DNA sequences have 
diverged, so they do not align correctly — to 
use the technical terminology, they do not 
synapse. Such asynapsis results in the death of 
sperm-cell progenitors, although the mecha- 
nism underlying this process is not well 
understood. Second, transcription from the 
sex chromosomes is not inhibited as it is in the 
meiosis of healthy testes’. 

Both of these defects can prevent normal 
sperm development. The extent of the damage 


is regulated by interaction between the different 
alleles of Prdm9 and another gene, Hstx2, that 
are present in each subspecies’. But an under- 
standing of which diverging sequences prevent 
synapsis, and of the molecular mechanisms 
underlying the roles of Prdm9 and Hstx2, has 
remained out of reach. 

Davies et al. now report considerable 
progress in defining the role of Prdm9 in hybrid 
sterility. They focused ona structure called the 
Cys,His, zinc-finger domain of mouse PRDM9, 
which is responsible fo r the protein’s ability to 
bind to specific DNA sequences. DNA bind- 
ing enables PRDM9 to add methyl groups to a 
lysine amino-acid residue on the protein his- 
tone H3, around which DNA is packaged in the 
nucleus. Such methylation is a prerequisite for 
the production of double-strand breaks (DSBs) 
in nearby DNA; these breaks are subsequently 
repaired through meiotic recombination, with 
the unbroken equivalent chromosome (the 
homologous chromosome) used as a genetic 
template for repair’. 

The authors engineered a ‘humanized’ form 
of Prdm9, in which they replaced the zinc-fin- 
ger domain from M. m. domesticus Prdm9 with 
the equivalent sequence from the human gene, 
which binds to a different DNA sequence. The 
humanized male mice were fertile but dis- 
played a pattern of recombination hotspots that 
shared little overlap with the hotspots of wild- 
type M. m. domesticus mice. Davies and col- 
leagues then crossed the humanized males to 
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M. m. musculus females. The hybrids were 
fertile, unlike hybrids from wild-type 
M. m. domesticus males, and the develop- 
ment of their sperm was normal. This finding 
clearly shows that it is the zinc-finger domain 
of PRDM9 that is responsible for sterility in 
hybrid mice. 

Analysis of DSB sites in the sperm-cell 
progenitors of infertile hybrids revealed that 
most DSBs occurred in asymmetric hotspots 
— that is, the PRDM9 protein produced from 
the set of chromosomes from one parent 
bound preferentially to sites on the homolo- 
gous chromosome. This asymmetry arises 
because of an evolutionary quirk known as 
the hotspot paradox. In this paradox, if any 
mutation that arises in a hotspot prevents 
PRDM9 from binding to that site, it is pref- 
erentially transmitted to the next generation, 
because the mutated sequence will be used as 
a template to repair DSBs in the unmutated 
hotspot on the homologous chromosome. 
In this way, PRDM9 binding sites are gradu- 
ally lost over time®. The asymmetry develops 
because these sites are not lost from closely 
related subspecies, in which PRDM9 binds a 
different sequence. 

Because humans diverged from mice long 
before the two mouse subspecies diverged, the 
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hotspots to which the humanized Prdm9 bound 
were mostly symmetrical; that is, the protein 
bound to the same sites on both sets of mouse 
chromosomes. The authors conclude that the 
fertility of the humanized mice is associated 
with this rescue of hotspot symmetry. In sup- 

port of this assertion, 
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with which meiotic 
asynapsis occurred in each chromosome pair. 
Thus, reduced PRDM9 binding symmetry is 
linked to impaired DSB repair and predisposi- 
tion to asynapsis. 

Davies and colleagues’ results provide 
considerable insight into how hybrid steri- 
lity arises in this particular model. However, 
hybrid sterility as a mechanism to prevent 
gene exchange between related species is a 
widespread phenomenon throughout the 
animal and plant kingdoms’, and the general 
significance of Prdm9 or Prdm9-like genes for 
the process of reproductive isolation is still 


Resolution beyond 
the diffraction limit 


A method has been devised that extends the resolution of X-ray crystal structures 
beyond the diffraction limit. This might help to improve the visualization of 
structures of proteins that form ‘poorly diffracting’ crystals. SEE LETTER P.202 


JIAN-REN SHEN 


that characterizes the diffraction of X-ray 
beams by crystalline materials’. Since 
then, crystallographers have used this law 
to solve the atomic structures of a wealth of 
compounds, ranging from simple table salt to 
extremely complex biomacromolecules. The 
resolution at which a structure can be visual- 
ized depends on the diffraction limit (or the 
Bragg limit), which in turn depends on the 
regular arrangement of molecules within crys- 
talline materials. The degree of this regularity 
is often low in crystals of biomacromolecules, 
and this creates a major barrier to visualiz- 
ing these molecules at the atomic level. On 
page 202 of this issue, Ayyer et al.’ report an 
approach to solving molecular structures at 
resolutions higher than the diffraction limit. 
The X-ray diffraction defined by Bragg’s law 


E 1913, Lawrence Bragg described the law 


is the sum total of diffraction patterns from 
molecules that strictly occupy the positions of 
a crystal’s lattice units. Obtaining such a strictly 
ordered crystal often represents a big obstacle 
to the study of hydrophobic, membrane- 
spanning proteins and their complexes, limit- 
ing the resolution at which the structures of 
these biologically important molecules can be 
solved. The low degree of order in protein crys- 
tals can be caused by translational and rota- 
tional displacements of the protein molecules 
from their ideal positions, and by dynamic 
motions within the molecules. 

Ayyer and colleagues propose that 
molecules in crystalline materials can be 
considered as rigid units, and that the disorder 
is caused by translational displacements of 
these units from their crystal-lattice positions. 
These displacements cause scattered X-rays 
to deviate from the trajectories generated by a 
perfectly ordered crystal lattice, giving rise to 
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uncertain. The next step will be to extend the 
authors’ work, both to analyse some of the 
other Prdm9 alleles that have been identified 
in various mouse subspecies*” and to consider 
other mammalian models of hybrid sterility. 
Only then will we be able to evaluate the fasci- 
nating possibility that Prdm9 has a recurrent 
role in reproductive isolation between closely 
related mammalian species’”. = 
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a vague pattern of light and shadowy regions, 
instead of a sharp diffraction pattern with dis- 
tinct spots (see Figure 1 of the paper’). This 
vague picture constitutes a kind of ‘continuous’ 
diffraction? * pattern, and contains informa- 
tion about the structure of the molecules in the 
crystals at resolutions extending well beyond 
the limit of Bragg diffraction. 

Continuous diffraction has frequently been 
observed in diffraction patterns of protein 
crystals, but had not previously been used 
for structural determination. Ayyer and co- 
workers’ approach is similar to that used for 
single-molecule imaging’ to obtain structural 
information from the continuous diffraction 
pattern, working on the basis that the diffrac- 
tion arises from the sum of aligned single- 
molecule diffraction patterns. 

The authors’ method requires both Bragg 
and continuous diffraction data, and treats 
these diffractions separately. First, the Bragg 
diffractions were analysed using conven- 
tional techniques, generating an electron- 
density map of the target molecule at low 
resolution. The researchers used this map to 
generate a low-resolution outline of the 3D 
molecular structure. This, in turn, was used 
to constrain a higher-resolution 3D image 
of the electron density, generated from the 
continuous diffraction data using an ‘itera- 
tive phasing’ algorithm®’. The structure was 
finally refined using a procedure similar to 
that used for single-particle cryo-electron 
microscopy data’. 

Ayyer et al. used this approach to extend the 


Figure 1 | Electron-density maps ofa section of photosystem II. Ayyer et al.’ 


that improves the resolution of X-ray crystal structures by analysing information from both conventional 
Bragg diffraction and continuous diffraction — which occurs when molecules in crystals deviate from 
their exact lattice positions only through translational displacements. a, Here, the structure of an a-helix 
chain of photosystem II, a photosynthetic protein complex, is superimposed on the electron-density 
map (grey mesh) of the chain obtained by analysing only Bragg X-ray diffraction. The resolution is 4.5 A. 
b, Using the authors’ method, the resolution of the electron-density map is improved to 3.5 A, and the 


molecular structure fits more precisely into the map. 


resolution of the structure of photosystem II 
(PSI), an extremely large, dimeric membrane- 
protein complex (the molecular weight of the 
dimer is about 700 kilodaltons). PSII splits 
water into electrons, protons and oxygen 
during photosynthesis, thereby sustaining 
aerobic life on Earth by providing oxygen 
and converting light energy into biologically 
useful chemical energy. The structure of PSII 
has previously been solved at a resolution of 
1.9 angstréms using X-rays from a conven- 
tional synchrotron source’, and at a resolu- 
tion of 1.95 A using femtosecond X-ray pulses 
(1 femtosecond is 10°’ s) from a free-electron 
laser (XFEL)"°. These structures indicated that 
the PSH molecules are rigid within the crystals, 
at least up to this level of resolution. However, 
some of Ayyer and colleagues’ PSII crystals dif- 
fracted to only 4.5 A resolution, limiting the 
authors’ ability to study the complex’s reaction 
dynamics. 

To obtain better resolution, the research- 
ers collected a data set from a stream of 
micrometre-scale PSII crystals using power- 
ful femtosecond XFEL pulses, and thus 
obtained an X-ray structure of 4.5 A based on 
Bragg diffractions. They then refined this to 
3.5 A resolution using their method based 
on continuous diffraction data. The resulting 
electron-density map showed more-detailed 
features of many amino-acid side chains 
(Fig. 1) and cofactors associated with the large 
protein complex, compared with the structure 
derived from Bragg diffraction data alone. This 
great improvement hints at the potential of this 
approach for extending resolution limited by 
Bragg diffraction. 

The extent to which rotational displacements 


and dynamic motions in crystals can gener- 
ally be ignored remains to be seen — this will 
determine how useful continuous diffraction 
data will be for improving resolutions. Nev- 
ertheless, the authors’ approach might find 
broad applications, from structural analysis 
beyond Bragg diffractions to single-mole- 
cule imaging, and help to improve the accu- 
racy of structures derived from X-ray data. 
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In particular, the technique is potentially a 
great step forward for those seeking high- 
resolution structural information for many 
‘poorly diffracting’ protein crystals and their 
complexes. 

It will be interesting to see how much further 
resolution can be improved by incorporating 
continuous diffraction data with X-ray data 
collected from both XFEL and synchrotron 
sources. Meanwhile, the reported iterative- 
phasing approach may allow structures to be 
solved without the use of additional methods 
to retrieve phase information from diffraction 
data. It is also to be hoped that the reported 
iterative-phasing algorithms will be made 
user-friendly and widely available. m 
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Exoplanets hidden 


in the gaps 


Planets develop from the disk of dust and gas that surrounds a newly formed star. 
Observations of gaps in the disks of four such systems have allowed us to start 
unravelling the processes by which planets form. 


PAUL HO 


he brightest objects in the night sky, 

besides the Moon, are the inner planets 

of the Solar System: because they are 
close to us, the light they reflect from the 
Sun is brighter than that of the stars. But exo- 
planets are as distant as stars, and are there- 
fore much dimmer, making them difficult to 
detect directly even with the biggest telescopes. 
Moreover, their companion stars are usually 
a billion times brighter than they are, and 
blurring and broadening of the stellar image 


by Earth’s atmosphere typically hides the 
nearby exoplanets. Writing in Astronomy & 
Astrophysics, van der Marel et al.' report that 
they have overcome these problems to acquire 
images of protoplanetary disks — the dust and 
gas surrounding a newborn star, from which 
planets form. Their observations show that 
nascent exoplanets have apparently begun to 
clear disk material from the inner zones of 
their systems. 

To understand how planets form, we need 
to find systems in which the protoplanetary 
disk is detectable so that the relationship 
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Figure 1 | The young star HL Tauri. This image reveals that rings and gaps are present in the disk of dust 
and gas surrounding the star. The gaps were probably opened and maintained by unseen planets. Van der 
Marel et al.' have detected gas in the central cavities of four other protoplanetary disks. 


between the disk’s evolution and planetary 
formation can be studied. The interaction 
between a planet and its surrounding disk 
can provide clues. The best model for such 
systems may be the planetary rings around 
Saturn or Jupiter. The gaps between rings are 
maintained by ‘shepherd’ moons in the gaps: 
gravitational interactions and the exchange 
of angular momentum between a moon and 
the disk opens a gap and preserves it. The gaps 
are much easier to detect than the shepherd 
moons, because the gaps have a considerably 
larger area, and because there is a large contrast 
between the gap and the rings. 

In 2015, a protoplanetary disk around 
the star HL Tauri was imaged’ by the Ata- 
cama Large Millimeter/submillimeter Array 
(ALMA) — the largest ground-based obser- 
vatory ever built, with a radio telescope that 
can detect the cold dust in disks with great 
sensitivity. The images showed that rings and 
gaps are present in the disk around HL Tauri 
(Fig. 1), just as they are around Saturn. Until 
now, this provided the best evidence of a sys- 
tem in which multiple planetary bodies had 
formed in a protoplanetary disk. Van der 
Marel and colleagues have now used ALMA 
to observe a sample of ‘transitional’ disks, 
which form around solar-type stars and have a 
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central cavity in which growing planets might 
be starting to clear the protoplanetary material. 

The authors used ALMA’ exceptionally 
high angular resolution to resolve the images, 
and used spectral lines associated with rare 
carbon isotopes in carbon monoxide (CO) 
to distinguish this gas in the disks. Locating 
the gas in disks is important for our under- 
standing of planetary formation, because gas 
is typically 100 times more abundant than 
dust in the interstellar medium, and typically 
a few times less abundant than that in proto- 
planetary disks’. The authors observed the 
dust by studying its continuum emission (a 
large range of wavelengths, rather than just 
the narrow ranges of spectral lines), and made 
their observations at submillimetre wave- 
lengths to suppress the interference from the 
companion star's light. 

On the basis of their CO measurements, 
the authors find that the central cavity in 
the dust in each studied system is not totally 
empty, but contains gas. More specifically, 
they observe that the regions depleted of dust 
have radii ranging from 25 to 60 Au (1 Avis 
the distance from Earth to the Sun), whereas 
the regions depleted of gas have smaller radii 
(7 to 30 au). Within the dust cavity (the region 
depleted of dust, but not of gas) there is at 


2016 


© 2016 Macmillan Publishers Limited. All rights reserved 


least 10,000 times more gas than dust. The 
researchers propose that the depletion of gas 
and dust in the cavities could be an effect of 
heating by electromagnetic radiation from 
the central star, or of perturbation by unseen 
shepherd planets. 

A planet's gravitational potential can open 
a gap ina disk by inducing orbital resonances 
— regular, periodic gravitational interactions 
with the disk material. But the viscosity of the 
gas in the disk allows transfer of the disk mater- 
ial’s angular momentum away from the reso- 
nance points; this transfer enables the gas to 
refill the gaps. The interaction of gas with the 
planets can give rise to: inward migration of 
those planets; a balance of angular-momentum 
transport between the planets and the disk; 
and the continual accretion of material onto 
the planets. 

The detailed physics of the exchange of 
angular momentum and energy between the 
disk and a planet is complicated. However, 
calculations*” have shown that, with judicious 
choices of boundary conditions and disk 
viscosity, the relative sizes of the gas and dust 
cavities provide a means of measuring the mass 
of the possible planetary shepherd. By meas- 
uring the properties of the gas in the central 
cavities of the four observed protoplanetary 
disks, van der Marel and co-workers estimate 
that the typical mass of an unseen planet is 
about 10 times that of Jupiter. For comparison, 
the average disk mass of the observed systems 
is about 50 times that of Jupiter. Intriguingly, 
the dust cavities are usually about the size of 
Neptune’ orbit. This implies that the planetary 
formation processes occurring in such systems 
might be similar to those that occurred in the 
Solar System. 

The current study is just the beginning of 
an upcoming avalanche of ALMA studies. The 
ability to measure spectral lines, such as those 
of CO, is crucial because it allows the meas- 
urement of motions within gaseous disks. Van 
der Marel and colleagues’ results clearly show 
rotation of the disks. More-sensitive data will 
allow us to see the motions and the structures 
of gas inflows. The results will begin to reveal 
how planets assemble in solar systems like our 
own. # 
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Re-engineering the zinc fingers of 
PRDM§9 reverses hybrid sterility in mice 


Benjamin Davies!*, Edouard Hatton!, Nicolas Altemose!, Julie G. Hussin!, Florencia Pratto?, Gang Zhang!, Anjali Gupta Hinch!, 
Daniela Moralli!, Daniel Biggs!, Rebeca Diaz!, Chris Preece!, Ran Li>?, Emmanuelle Bitoun!, Kevin Brick*, Catherine M. Green!, 
R. Daniel Camerini-Otero*, Simon R. Myers!s & Peter Donnelly!s 


The DNA-binding protein PRDM9 directs positioning of the double-strand breaks (DSBs) that initiate meiotic recombination 
in mice and humans. Prdm9 is the only mammalian speciation gene yet identified and is responsible for sterility 
phenotypes in male hybrids of certain mouse subspecies. To investigate PRDM9 binding and its role in fertility and meiotic 
recombination, we humanized the DNA-binding domain of PRDM9 in C57BL/6 mice. This change repositions DSB hotspots 
and completely restores fertility in male hybrids. Here we show that alteration of one Prdm9 allele impacts the behaviour of 
DSBs controlled by the other allele at chromosome-wide scales. These effects correlate strongly with the degree to which 
each PRDM9 variant binds both homologues at the DSB sites it controls. Furthermore, higher genome-wide levels of such 
‘symmetric’ PRDM9 binding associate with increasing fertility measures, and comparisons of individual hotspots suggest 
binding symmetry plays a downstream role in the recombination process. These findings reveal that subspecies - specific 
degradation of PRDM9 binding sites by meiotic drive, which steadily increases asymmetric PRDM9 binding, has impacts 
beyond simply changing hotspot positions, and strongly support a direct involvement in hybrid infertility. Because such 


meiotic drive occurs across mammals, PRDM9 may play a wider, yet transient, role in the early stages of speciation. 


In spite of its central role in evolution, the molecular mechanisms 
underlying speciation are not well understood. Only a small number 
of genes involved in speciation have been documented’, with only one 
such gene, Prdm9, known in mammals”. Prdm9 contributes to hybrid 
sterility in male (PWD x B6)F1 mice from crosses between male Mus 
musculus domesticus C57BL/6 (hereafter B6) and female Mus musculus 
musculus PWD/Ph (hereafter PWD)*. Although its genetic basis is only 
partially understood®*, this hybrid sterility is characterized by failure of 
pairing (synapsis) of homologous chromosomes and an arrested meiotic 
prophase owing to lack of repair of recombination intermediates’. 
Homologous recombination, and synapsis, are interdependent, essen- 
tial meiotic processes’, and evidence suggests synapsis often nucleates 
at recombination sites®. Aside from the PWD x B6 cross, Prdm9 allele 
and dosage have been associated with variation in measures of fertility 
and successful meiosis in many additional mouse crosses’. 

PRDM9 has several functional domains, including a DNA-binding 
zinc-finger array, and a PR/SET domain responsible for histone H3 
lysine 4 trimethylation (H3K4me3)’” (Fig. 1a). By binding to spe- 
cific DNA sequence targets, PRDM9 directs the positions of the DSB 
events that initiate meiotic recombination'’. This results in these DSBs 
and downstream recombination events clustering into small discrete 
regions called hotspots!*!*. The PRDM9 zinc-finger array, encoded by a 
minisatellite repeat, is highly polymorphic within and across mammalian 
species*!*"!6 and is among the fastest evolving regions in the genome, 
with strong evidence of natural selection influencing this evolution’. It 
is unknown whether PRDM9 zinc-finger array polymorphism has addi- 
tional impacts, aside from direct alterations of DSB hotspot positions. 


Humanizing Prdm9 restores hybrid fertility 
To explore the DNA-binding characteristics of PRDM9, we generated 
aline of humanized B6 mice, by replacing the portion of mouse Prdm9 


exon 10 encoding the zinc-finger array with the orthologous sequence 
from the human reference PRDM9 allele (the ‘B allele) (Fig. 1a, 
Extended Data Fig. 1). A feature of PRDM9 (explored further below) 
is the co-evolution of its zinc-finger array with the genomic background 
in which it sits'*'”. Minisatellite mutational processes at PRDM9 can 
produce new alleles with duplications, deletions or rearrangements 
within the zinc-finger array, yielding an almost complete change in 
PRDM9 binding sites, and thus hotspot locations!*, Because the human 
PRDM9 zinc-finger array evolved on a lineage separated from mice 
for ~150 million years, our experimental approach allows assessment 
of functional properties of a PRDM9 zinc-finger array unaffected by 
changes it has induced in the background genome, similar to new alleles 
randomly arising in the population. 

Humanization of the Prdm9 zinc-finger array in B6 inbred mice had 
no effect on fertility (Extended Data Fig. 2) and cytogenetic compar- 
isons revealed no significant impact on zygotene DSB counts (DMC1 
immunoreactivity, Extended Data Fig. 2b), crossover counts (MLH1 
foci, Extended Data Fig. 2c), normal sex body formation (YH2AX 
immunostaining, Fig. 1b) or quantitative measures of fertility and 
successful synapsis (see later). The full fertility of humanized mice 
implies there are unlikely to be any specific essential PRDM9 binding 
sites. One mechanism underlying speciation in many settings involves 
Dobzhansky-Muller incompatibilities: hybrid dysfunction arising 
from incorrect epistatic interactions!. Based on the above, it seems 
likely that if such interactions involving PRDM9 occur, they do not 
reflect constrained co-evolution of Prdm9 with specific genes. 

To explore the role of PRDM9 in fertility directly, we crossed PWD 
females with B6®*4 males. As expected!*, male (PWD x B6)F1?WP’B> 
hybrids (we use superscripts to indicate Prdm9 genotypes and write 
the female strain first in crosses) exhibited hybrid sterility as evidenced 
by failures in siring pups (Extended Data Fig. 2e), sex body formation 
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Figure 1 | Humanizing the zinc-finger domain of PRDM9 does not 
impact fertility. a, Domain structure of the re-engineered PRDM9 
protein b, yH2AX staining of the sex body (green), SYCP3 staining of 

the chromosome axis (red) in late pachytene in B6"*®* (top) and Bo#!/# 
(bottom). c, As b, but for (PWD x B6)F1PW?/®* and (PWD x B6)FIPWP/H, 
d, SYCP1 staining of the synaptonemal complex transverse filament 
(green), and SYCP3 staining of the chromosome axis (red) in pachytene 
for (PWD x B6)F1?W/8¢ and (PWD x B6)F1?W?/4, Arrows, unsynapsed 
autosomes. Scale bars, 101m. 


(Fig. 1c) and synapsis (Fig. 1d). In contrast, all these defects were 
completely rescued in (PWD x B6)F1?“?/ hybrids inheriting 
the engineered humanized zinc-finger array (Fig. 1c, d, Extended 
Data Fig. 2e). Thus the zinc-finger domain of PRDM9, and hence 
probably the DNA-binding properties of this protein, underlies the 
role of Prdm9 in hybrid sterility. 

Although (PWD x B6)F1?“’®* male mice are completely sterile, 
the male progeny of the reciprocal cross (B6 x PWD)F13°/PWP 
are semi-fertile®. A particular 4.7 Mb locus (Hstx2) on the PWD 
X chromosome influences these fertility differences®. We also tested 
the impact of humanization in this reciprocal cross, and full fertility 
(from semi-fertility) was again restored (see below and Supplementary 
Information). Thus humanization of PRDM9 acts at least partially 
independently of Hstx2. 

Our reprogramming of the PRDM9 binding sites mimics the conse- 
quences of mutational changes in its zinc-finger array. The restoration 
of hybrid fertility suggests that the same rescue is likely to occur for 
newly arising alleles that also reset PRDM9 binding sites, and hence 
hybrid sterility between subspecies driven by Prdm9 will be evolu- 
tionarily transient. This raises the question, which we return to below, 
of what properties are possessed by Prdm9 alleles that are associated 
with reduced fertility. 


Humanizing the recombination landscape 

To characterize the consequences of re-engineering the zinc-finger 
domain on recombination, we generated high-resolution DSB maps 
for mice with different Prdm9 alleles and genomic backgrounds, 
using ChIP-seq single-stranded DNA sequencing’? on adult testes. 
This approach identifies single-stranded 3’ sequence ends decorated 
with DMC1, which arise as intermediates following creation of DSBs 
by SPO11. In addition to mapping DSB hotspots, our hotspot-calling 
algorithm estimates a hotspot ‘heat proportional to the fraction of 
cells marked by DMC1 at that locus (Supplementary Information). 
This DMC1 heat depends on both the relative frequency of DSB 
formation and on how long DMC1 marks persist”®. We also obtained 
complementary information by performing ChIP-seq to measure 
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H3K4me3, a histone modification directly introduced in cis by 
PRDM9 binding!!. 

Relative to wild-type B6 mice mice showed completely 
changed hotspot landscapes (2.6% overlap; Extended Data Fig. 3), 
with hotspots in the humanized mouse showing strong enrich- 
ment for a motif matching the previously reported human PRDM9 
binding motif!’ (Extended Data Fig. 4). Most DSB hotspots overlapped 
H3K4me3 peaks (89%, P< 0.05). Correlation between the wild-type 
and humanized mice in total DSB heats increased over larger genomic 
scales (Extended Data Fig. 3b), consistent with earlier studies showing 
large-scale crossover rates depend on factors other than PRDM9'®??, 

In the heterozygous mouse, despite the presence of two different 
Prdm9 alleles, we found a similar number of hotspots to homozygous 
mice (Supplementary Table 1). Furthermore, almost all B6®°/# 
hotspots (95.8%) were found in either the B6®°/®* or B64 mice 
(Extended Data Figs 3c, 4c, Supplementary Table 2). The human allele 
exhibited 2.7-fold dominance over the wild-type allele (Supplementary 
Table 2), with even stronger dominance for hotter hotspots (Extended 
Data Fig. 3c). Comparison of homozygous and heterozygous hotspot 
heats (Extended Data Fig. 3d, e) implies B6 hotspots operate similarly, 
but are proportionally less active, in the heterozygote. For additional 
DSB hotspot analyses, see Supplementary Information and Extended 
Data Fig. 5. 


21 BoH/H 


Humanization restores symmetric binding 

Next we examined DSB hotspot maps for hybrid males: infertile 
(PWD x B6)F1>W?'®®, reciprocal semi-fertile (B6 x PWD)F1P°PWP, 
humanized rescue (PWD x B6)F1?W?/%, and reciprocal humanized 
rescue (B6 x PWD)F14/?WP, with wild-type PWD for comparison. 
Sequence differences between the PWD and B6 genomes allowed 
us to determine whether individual hotspots in these hybrids were 
‘symmetric, with DSBs occurring equally on both chromosomes, or 
‘asymmetric, with a preference towards either the PWD or B6 chro- 
mosome (Supplementary Information Section 5). 

We found that most DMC1 signal (71.8%) in (PWD x B6)F1PWP/B6 
or (B6 x PWD)F18°?WP hybrids occurs within asymmetric DSB 
hotspots (Fig. 2a, Extended Data Fig. 6). Further, DSBs associated 
with the PWD allele occur largely on the B6 chromosome and those 
associated with the B6 allele occur largely on the PWD chromosome. 
We also measured asymmetry of the H3K4me3 mark at each hotspot 
and found the same pattern, confirming that DSB asymmetry largely 
reflects underlying differences in PRDM9 binding and methylation 
between the two homologues. This H3K4me3 asymmetry resembles 
that previously described for (B6 x CAST)F12°AST hybrids!’, but is 
considerably more extreme. Sequence differences directly disrupting 
PRDM9 binding motifs explain almost all cases of binding asymme- 
try (83.4% of PWD hotspots; 91.3% of B6 hotspots), and result from 
rapid mutational accumulation along the separate lineages from the 
common ancestor of B6 and PWD (Extended Data Fig. 6g). 

Such asymmetry can arise through meiotic drive to favour muta- 
tions disrupting PRDM9 binding motifs, within populations where 
these motifs are active. Any new mutation disrupting PRDM9 
binding at a hotspot is preferentially transmitted to offspring: in 
individuals heterozygous for the mutation, DSBs occur preferentially 
on the non-mutant chromosome and are then repaired by copying 
from the mutant chromosome”. This phenomenon has been observed 
at PRDM9 binding motifs in human!? and mouse’’ and causes a 
rapid accumulation of mutations disrupting PRDM9 binding. B6é and 
PWD Prdm9 alleles are largely subspecies-specific'’, so only the B6 
lineage has experienced strong erosion of the B6 binding motif, and 
only the PWD lineage has experienced strong erosion of the PWD 
binding motif. This asymmetric erosion explains the highly asymmetric 
PRDM9 binding sites in F1 hybrids. 

Because the human allele has not been present in mice, its bind- 
ing sites have not experienced erosion in the mouse genome. As a 
consequence, DSBs at hotspots attributable to the human allele occur 
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mostly (57%) in symmetric hotspots, with the remaining, asymmetric 
hotspots mainly (84.2%) explained by the presence of mutations that 
coincidentally fall within the human PRDM9 binding motif (Fig. 2b). 
Conversely, only 30% of DSBs at hotspots attributable to the Bé6 allele 
occur in symmetric hotspots. An identical pattern is seen in the recip- 
rocal crosses. Thus, a genome-wide effect of humanizing the mouse 
is to reprogram hotspot positions with the consequence that hotspot 
asymmetry is reduced in the hybrids. 

Meiotic drive might also explain dominance, as seen for the human 
Prdm®9 allele over the B6 allele in the B6®*"' mouse, because B6 motifs 
are heavily eroded on the B6 background. To test this, we created F2 
mice to analyse the behaviour of the B6 and humanized alleles on 
a neutral Mus musculus castaneous (CAST/EiJ) background which 
has been unaffected by B6 motif erosion (Extended Data Fig. 6h). 
Dominance of the human allele disappeared in regions of the genome 
with two copies of the CAST genome—removing the effect of motif 
erosion removes the dominance (Fig. 2c). This result excludes 
some factors which might influence dominance (Supplementary 
Information), and also suggests that recently arisen Prdm9 alleles 
might be dominant over older alleles, for which meiotic drive will 
have had more time to degrade binding motifs. 


Chromosome-specific trans effects of humanization 

The infertile and humanized rescue mice share some hotspots, 
controlled by the PWD allele. These shared hotspots show strong 
correlation (r? = 0.63) in DMC1 heat, but nevertheless far weaker 
than that between hotspots in the infertile and reciprocal mice (0.95). 
To explore this weaker correlation, we compared DMC1 heats in 
the two mice for each shared hotspot and calculated their ratio. 
We observed substantial differences in these ratios across different 
chromosomes (Fig. 2d). Thus, substituting the Bé6 allele for the human 
allele impacts hotspots that neither allele binds directly, in trans, and 
this impact is observed at broad genomic scales. This trans effect might 
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simultaneous confidence level for 19 chromosomes). 


reflect differences in either the formation, or downstream processing, of 
DSBs. In contrast to DMC1, the H3K4me3 heat showed no significant 
chromosomal ratio differences (Fig. 2e), implying that the trans effect 
probably operates downstream of PRDM9 binding. Furthermore, com- 
parison of DMC1 heats between B6®®¢ and B62” mice also revealed 
chromosome effects (Extended Data Fig. 7). This implies that such 
trans effects do not depend on SNP presence (the B6 background is 
fully homozygous), and cannot simply be a consequence of asynapsis 
(observed only in the infertile mouse). 

Next, we sought to understand the drivers of these chromo- 
some-specific differences in DMC! heat by testing various potential 
predictors of these differences between the infertile and humanized 
rescue mice (Supplementary Information). After an exhaustive search 
over possible models, given the predictors considered, the best-fitting 
model was highly predictive (r? =0.84; Fig. 2f) and included only sym- 
metric hotspot measures—the total H3K4me3 signal from PRDM9 
binding on both homologues (that is, symmetrically) at the same hot- 
spots, summed over the entire chromosome—for each of the three 
Prdm9 alleles (P < 0.01 in each case). The trans effect is thus explained 
by knowledge of only the direct differences in PRDM9 binding tar- 
gets across mice, without any additional information regarding other 
features such as SNP diversity, consistent with the sole difference 
between the infertile and rescue mice being the zinc-finger array of 
Prdm9. Moreover, only symmetric hotspots (in the infertile mouse, a 
minority) provide predictive power. 

The fitted model implies that lower overall symmetric binding 
results in increased DMC1 heat, at a chromosome level. The same 
properties (P < 0.0002; Supplementary Information) hold true in 
the comparison between B6®*’®* and B6®*/" mice. Although the B6 
background is completely homozygous, so PRDM$ is predicted to 
mark H3K4me3 equally on both homologues, different total levels of 
H3K4me3 marking across chromosomes still occur and these correlate 
with observed differences in DMC1 heat between the two genotypes. 


11 FEBRUARY 2016 | VOL 530 | NATURE | 173 


© 2016 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


a Symmetry metric Normal sex body Proper synapsis Testes weight b 
PWDxB6)F1 W786 
Cae = = = = Chr 19 | | | 
(DEB =e Chr17 | | | 
Pw | os os 
CXC a as —-_ Chr 18 —- oe 
OVC || a | a | | a Chr 16 | - a 
(COXPWO IO | || ee | | se — chro | me =e 
Fertile 
T T T T T T T T T T T T T T T T T T T T T T T T T T 
0 ) 2 3 4 0 25 50 75 100 O 25 50 75 100 O 4 8 12 10 12 14 1.6 40 60 80 100 


Percentage of cells with 


Symmetry metric (x10-5) 
normal sex bodies (%) 


Percentage of cells with 
autosomes properly synapsed (%) per gram of lean weight (mg g’) 


Observed DMC1 ratio 
(infertile:rescue) 


Paired testes weight Synapsis rate 


in infertile (%) 


Figure 3 | Humanizing PRDM$9 restores proper synapsis and rescues fertility in hybrids. a, Fertility metrics in hybrid mice. Error bars represent bootstrap 
95% confidence intervals (symmetry metric), or +1 s.e. (other metrics) b, Chromosome effects in DMC1 signals (as Fig. 2d) versus previously reported° 
asynapsis rates for five chromosomes in infertile (PWD x B6)F1?WY/®6, Error bars, +1 s.e. 


This excludes sequence differences at or near hotspots, or asynap- 
sis itself, as a cause, and suggests that the total amount of symmetric 
binding on each chromosome, as opposed to a simple lack of asym- 
metric binding, plays an important role in predicting DMC1 heat. 
The direction of causality is reasonably clear (binding predates DSB 
formation, and the H3K4me3 mark lacks similar chromosome effects), 
while confounding influences should always be shared between the 
mice being compared and thus cannot alone explain the observed 
inter-chromosomal differences. It therefore appears that differences in 
the level of overall symmetric binding by PRDM9 drive downstream 
trans effects at chromosomal scales, with lower symmetric binding 
somehow increasing the number, or repair time, of DSBs even at dis- 
tant hotspots. 


PRDM9 binding symmetry and synapsis 

Sterile (PWD x B6)F1?”’®° hybrids show very high rates of asynapsis, 
particularly at specific chromosomes”, and failure to form the sex body 
during early meiosis>’. By contrast, these phenotypes are completely 
rescued in (PWD x B6)F1?W?" hybrids harbouring the humanized 
Prdm9 allele (Fig. 1c, d). Having seen a relationship between PRDM9 
binding symmetry and the recombination process, we examined 
binding symmetry in relation to fertility. For different male mice, we 
measured three quantitative fertility phenotypes” (Fig. 3a), and calcu- 
lated several genome-wide measures of hotspot symmetry (Extended 
Data Fig. 8; Supplementary Information). We observed a significant 
correlation (P=0.0083; rank correlation permutation test) between the 
DMC1 symmetry measures and the rate of proper synapsis among all 
nine mice studied. In humanized hybrid mice, the observed increase 
in symmetry was accompanied by improved fertility. Notably, this 
improvement effect is stronger than the Hstx2 modifier, responsible for 
the difference in asynapsis and fertility observed between the sterile and 
reciprocal hybrids? (Fig. 3a). An additional mouse hybrid, (B6 x CAST) 
F1B°/CAST showed intermediate PRDM9 binding symmetry” and also 
an intermediate asynapsis level. Symmetry measures in homozygous 
mice (PWD, B6?/®6, B64, B6H/#) are, as expected, much higher than 
hybrids, and these mice show the highest synapsis rates and fertility 
measures. 

Previous work? showed that in the infertile (PWD x B6)F1 mouse, 
synapsis failure occurs at different rates among five chromosomes 
tested. We compared the reported asynapsis rates for these five chro- 
mosomes with the chromosome-specific DMC1 heat effects described 
above and found an identical ranking (P=0.017 by rank correlation 
permutation test; Fig. 3b). Because these DMC1 heat effects are 
strongly predicted by symmetric H3K4me3 levels in the infertile 
mouse, this result implies that chromosomes with lower symmetric 
PRDM9 binding experience higher asynapsis rates. This may explain 
why lower symmetric PRDM9 binding genome-wide accompanies 
higher overall asynapsis rates among different mice. 

Having found elevated DMC1 heat on chromosomes influenced by 
asynapsis (where homologous pairing fails), we examined DMC1 and 
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H3K4me3 heats in two additional settings, where no homologue exists 
at all and thus homologous chromosome pairing cannot occur: the X 
chromosome in male mice, and separately in humanized hybrid mice 
at autosomal hotspots where the human PRDM9 binding motif lies 
within a region deleted in the PWD genome. In both these settings, 
we observed an elevation of DMC1 heat relative to autosomal hotspots 
bound symmetrically by PRDM9 (Extended Data Fig. 9). Elevation of 
DMCI heat might, therefore, be a consistent signature of non-pairing 
of homologous chromosomes during meiosis. DMC] elevation might 
be explained by an increased probability of a DSB occurring at that 
site, or by the DMC1 coating at breaks persisting for longer (delayed 
repair). However, the total number of RAD51-marked DSBs initiated 
per cell is tightly regulated*®, remaining unchanged even in Prdm9 
knockouts?®, while in both knockouts and infertile hybrids, DSB marks 
indeed persist late into pachytene, suggesting a failure of repair>”®. 
Therefore, the elevated DMC1 signals we observe may be explained 
by persistence of DMC1 where homologous repair is compromised 
or delayed. 


PRDM9-dependent homologue interactions 
Given our chromosome-scale observations, we next asked whether 
symmetric binding at individual hotspots might also influence DMC1 
heat. At each human-controlled hotspot in the humanized rescue, we 
measured the component of total DMC1 heat contributed by the B6 
chromosome only, and compared this to the DMC1 heat for the same 
hotspot in Bo"/". The comparison revealed (Fig. 4a) a remarkably 
strong, and clear, elevation in DMC! heat in the hybrid mouse for the 
asymmetric hotspots (>90% asymmetry, towards binding of only the 
B6 chromosome), relative to the symmetric hotspots (those within 10% 
of complete symmetry). However, similar to the chromosomal analysis, 
H3K4me3 enrichment showed no difference whatsoever between 
symmetric and asymmetric sites in these mice (Fig. 4b). Indeed a 
comparison of H3K4me3 and DMC1 heat revealed a far higher (Fig. 4c) 
ratio of average DMCI1 heat to H3K4me3 enrichment for asymmetric 
relative to symmetric hotspots, across all hybrid mice, backgrounds, 
and Prdm9 alleles tested (Extended Data Fig. 9d). This effect reflects 
a consistent elevation of DMC1 heat at DSB sites on individual 
chromosomes when the homologue is not bound strongly (Extended 
Data Fig. 9e, f). This phenomenon cannot easily be explained by factors 
including local heterozygosity within or outside the PRDM9 motif, 
the type of mutation(s) disrupting PRDM9 binding, or outlier effects 
(Extended Data Figs 9, 10; Supplementary Information Section 13). 
Thus, elevation of DMC1 heat on the bound chromosome appears 
to be a universal feature of hotspots where PRDM9 binds asymmet- 
rically, relative to symmetrically bound hotspots. By contrast, the 
results for H3K4me3 suggest the mark is deposited in an independent 
manner on each homologue (Supplementary Information Section 12.2). 
This implies the DMCI heat elevation depends on a process involving 
symmetric PRDM9 binding, downstream of H3K4me3 deposition, 
involving both homologues. While we cannot exclude the possibility 
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that somehow more DSBs occur at asymmetric hotspots, this would 
require early, precise pairing of homologues, at least at hotspots, before 
DSB formation, to determine which hotspots are symmetrically bound. 
Although there is some evidence of pre-meiotic homologue associa- 
tion’’, current data do not suggest the existence of precise pairing 
before DSB formation”’. The alternative and more plausible explanation 
is that sites where PRDM9 binds asymmetrically simply experience 
a delay in DSB processing, delaying DMC1 removal compared to 
symmetric DSB hotspots. While our data represent the collective 
behaviour of populations of cells, this model suggests a mechanism 
of PRDM9-dependent interaction between homologues influencing 
downstream DSB processing operating within individual cells, which 
we discuss below (also Supplementary Information Section 14). 


Discussion 

Only one mammalian speciation gene, Prdm9, has so far been iden- 
tified. Humanizing the zinc-finger array of Prdm9 redirects binding, 
thereby entirely reprogramming recombination hotspots, and in doing 
so reverses the hybrid infertility between musculus and domesticus sub- 
species. This modification mimics the consequences of a newly arising 
allele and thus suggests that Prdm9 evolution (for example, rapid fixa- 
tion of particular existing variants*'> or novel alleles arising by muta- 
tion) in either or both subspecies would also restore hybrid fertility. 

Multiple lines of evidence in our data, at chromosomal, whole- 
organism, and individual hotspot scales, strongly suggest novel roles 
for PRDM9 in the formation or processing of DSBs downstream of 
H3K4me3 deposition, dependent upon symmetric binding. Several 
aspects of our, and published, data (comparison between B6*/®¢ and 
B62“ mice, see also Supplementary Information) also mean that 
our results cannot be fully explained simply by sequence differences 
within or around hotspots, which do not specifically impact binding 
symmetry. 

Pervasive asynapsis is proposed to be the underlying cause of infer- 
tility in hybrid mice*. We observed a positive relationship between 
symmetric PRDM9 binding and correct synapsis of homologous chro- 
mosomes later in meiosis. Replacing the B6 allele with the human- 
ized allele in hybrids greatly increases symmetric binding, restoring 
proper synapsis and fertility. Many apparently complex relationships 
have previously been reported between naturally occurring mouse 
Prdm9 alleles, allelic dosage, and quantitative fertility measures in 
hybrids?. Each of ten manipulations shown or predicted to increase 
PRDM9 binding symmetry also increases meiotic success and fertil- 
ity (Supplementary Information), supporting the idea that the link 
between binding symmetry and fertility might be very general, and 
causal. 

The erosion of PRDM9 binding sites through meiotic drive!” also 
occurs at human hotspots’, and probably across many mammals. In 
two populations separated for sufficient time, differential PRDM9 
binding site erosion will decrease symmetry in hybrids, which is likely 


for H3K4me3. c, Comparison of DMCI signals in (PWD x B6)FIPWP/86, 
at symmetric and asymmetric hotspots, binned by H3K4me3 enrichment 
(medians shown). H3K4me3 and DMC1 signals are estimated on the PWD 
chromosome only, for hotspots associated with B6 PRDM9. 


to decrease fertility levels (though not necessarily to the extreme of 
sterility). Therefore, PRDM9 may affect hybrid fertility levels across 
many mammalian species and so might repeatedly act in driving 
early speciation steps, although the rapid evolution of the zinc-finger 
array of PRDM9 implies an unexpected transience of this direct role. 
However, even subtle or transient PRDM9-driven reductions in 
fertility might still provide a selective advantage to additional muta- 
tions contributing towards speciation. This mechanism is different 
from the previously characterized causes of intrinsic hybrid incompat- 
ibilities, such as differences in ploidy, chromosomal rearrangements, 
or incompatibilities between genes. The extent to which it has been 
responsible for speciation in the natural world appears an interesting 
question for further research. 

One plausible mechanism for the impacts of (a)symmetry involves 
a role for PRDM9 binding in aiding homology search—a process 
thought to involve invasion of the homologous chromosome to probe 
for homology by single-stranded DNA formed around DSBs”. It 
has been suggested that synaptonemal complex proteins are loaded 
at some DSB sites and synapsis begins to spread”®. Extending this 
model, to incorporate the property that asymmetrically bound sites 
are less favourable for homology search, would parsimoniously 
predict each symmetry-related phenomenon we observed: DSBs at 
asymmetric hotspots would repair more slowly, elevating their DMC1 
signal, and chromosomes with fewer symmetric hotspots overall 
would show delayed DSB repair and higher asynapsis rates, ultimately 
causing subfertility or sterility in animals with low symmetric bind- 
ing. It is not known how homology search occurs efficiently in the 
nuclear environment, given the enormous potential search space of 
the genome”, or why hotspots exist at all. Both phenomena could be 
explained by the above model in which homology search is focused 
at least partly on hotspot positions. Indeed hotspots might massively 
increase search efficiency by directing homology search to PRDM9 
binding sites. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Gene targeting in embryonic stem cells. A C57BL/6J (B6) mouse genomic BAC 
clone (RP23-159N6) encompassing the Prdm9 gene was used for subcloning of 
homology regions. A 7kb XmalI/Spel fragment upstream of exon 10 and a 2.5kb 
BamHI/Spel fragment downstream of exon 10 were used as 5’ and 3’ homology 
regions, respectively. The intervening 4kb Spel/BamHI encoding exon 10 and 
flanking intronic regions were subcloned and an internal 1.4kb BglII-Nhel frag- 
ment, containing the coding region of the zinc-finger array, was replaced with a 
synthesized fragment (Life Technologies) encoding the zinc-finger array from the 
human B allele. All coding sequence 5’ of the first zinc finger and all 3’ untranslated 
regions (UTR) downstream of the stop codon were left as mouse. This human- 
ized fragment was then assembled between the two homology arms, upstream of 
a neomycin selection cassette. PhiC31 attP sites were incorporated immediately 
downstream of the 5’ homology arm and between the PGK promoter and the 
neomycin phosphotransferase open reading frame to equip the locus with PhiC31 
integrase cassette exchange machinery for subsequent manipulations"). 

The completed targeting vector was linearized with Apal and electroporated 

into mycoplasma free C57BL/6N JM8F6 embryonic stem cells (Extended Data 
Fig. 1a). JM8F6 cells were a gift from B. Skarnes, Wellcome Trust Sanger Institute. 
Following selection in 210g ml! G418, recombinant clones were screened by 
PCR to detect homologous recombination over the 3’ arm. A forward primer 
(5'-TACCGGTGGATGTGGAATGTG-3’) binding within the PGK promoter was 
used together with a reverse primer (5’-TGACAGCAAAAACCACCTCTA-3’) 
binding downstream of the 3’ homology arm to amplify a 2.7 kb fragment 
from correctly recombined clones. Positive clones were examined for cor- 
rect recombination at the 5’ end by long range PCR using a forward primer 
(5'-CAGAGGACCTTTAGTCTGTGAGGG-3’) binding upstream of the 5’ homol- 
ogy arm and a reverse primer (5‘-AGCAGAGGCTTGACCTATCGCTAA-3’) 
binding within the humanized region. Correctly targeted clones yielded a 10.4kb 
amplicon. Sanger sequence analysis of the 10.4kb amplicon encompassing the 5’ 
homology arm with primer 5’/-CCTTTCTCAATGATCCACAAAT-3’ confirmed 
the correct integration of the 5’ attP sequence, necessary for future manipulations 
of the locus. Southern blotting using a probe against neomycin was used to confirm 
that only a single integration event had occurred. 
Mouse production and matings. Mice were housed in individually ventilated 
cages and received food and water ad libitum. All studies received local ethical 
review approval and were performed in accordance with the UK Home Office 
Animals (Scientific Procedures) Act 1986. Experimental groups were determined 
by genotype and were therefore not randomized, with no animals excluded from 
the analysis. Sample size for fertility studies and cytogenetics (see below) were 
selected on the basis of previously published studies*””. No statistical methods 
were used to predetermine sample size. All phenotypic characterization was 
performed blind to experimental group. 

ES cells from correctly targeted clones were injected into albino C57BL/6J 
blastocysts and the resulting chimaeras were mated with albino C57BL/6J 
females. Successful germline transmission yielded black pups and F1 mice 
harbouring the humanized Prdm9 allele were identified using the above 
attP screening PCR. F1 heterozygous male mice were bred with C57BL/6] 
Flp recombinase deleter mice (Tg(ACTB-Flpe)9205Dym (Jax stock 
005703)) and offspring were screened for the deletion of the selection cas- 
sette using a forward primer (5’‘-TTCTGCCATCACTTCCTTCGGTGA-3’) 
binding immediately upstream of the cassette and a reverse primer (5’- 
TCTGAAGCCCAACTATTTCATTAATACCCC-3’) binding immediately 
downstream of the cassette. A 677-bp amplicon was obtained from the Flp-deleted 
humanized allele and a 491-bp amplicon was obtained from the wild-type allele. 
Heterozygous humanized mice without the selection cassette were then back- 
crossed with C57BL/6) to remove the Flp transgene before intercrossing to obtain 
experimental cohorts of heterozygous, homozygous and wild-type mice which 
were genotyped with the above PCR. PWD/PhJ mice were a gift from J. Forejt, 
Institute of Molecular Genetics, Prague, Czech Republic and CAST/EiJ were 
sourced from MRC Harwell. 

Fertility was assessed in male mice between the ages of 2 and 4 months by 
recording the average number of pups obtained when bred with 7-week-old wild- 
type C57BL/6] female mice. Paired testes weight was recorded and normalized 
against lean body weight, as assessed using EchoMRI-100 Small Animal Body 
Composition Analyzer. 

Immunohistochemistry analyses. Spermatocytes from mice at approximately 9 
weeks of age were prepared for immunohistochemistry by surface spreading*>™*. 
In brief, the testis tunica was removed, the tubules cut with a razor blade and dis- 
assembled by pipetting, in PBS, containing protease inhibitors (Complete, Roche). 
Following centrifugation at 5,800 g for 5 min, the cells were resuspended in 0.1M 
sucrose, and spread onto the surface of slides in a drop of 1% paraformaldehyde 
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in PBS. The slides were left to dry for 3h at room temperature, in a humidified 
box, then washed in 0.4% Photo-Flo 200 (Kodak), and either used immediately, 
or stored at —80°C. For immunohistochemistry the following antibodies were 
used: mouse anti- MLH1 (BD, 51-1327GR); mouse anti-phospho-H2A.X (Millipore 
05-636, clone JBW301); rabbit anti-SYCP1 (Novus Biological, NB300-229); 
rabbit anti-DMC1 (Santa Cruz Biotechnology sc-22768, H-100); mouse anti- 
SYCP3 (Santa Cruz Biotechnology sc-74569, D-1); rabbit anti-SYCP3 (Abcam 
ab15093). Non-specific binding sites were blocked by incubating the cells with 
0.2% BSA, 0.2% gelatin, 0.05% Tween-20 in PBS (B/ABD buffer). Cells were 
incubated with the primary antibodies overnight at 4°C. Following washes in 
B/ABD buffer and detection with secondary antibodies, the slides were mounted 
in DAPI/Vectashield (Oncor) and analysed with an Olympus BX60 microscope 
for epifluorescence, equipped with a Sensys CCD camera (Photometrics, USA), 
using Genus Cytovision software (Leica). 

Spermatocytes were staged based on SYCP3 staining. For MLH1 analysis, only 
pachytenes with 19 or more foci, co-localizing with SYCP3, were considered, 
according to criteria defined by ref. 35. For DMC1 analysis, randomly selected cells, 
from any stage, were scored. The number of DMC] foci per cell was counted using 
the PointPicker macro in ImageJ (64 bit). For SYCP1 analysis, only cells in pachy- 
tene were considered. Cells with 19 fully synapsed autosomes, with co-localizing 
SYCP1 and SYCP3 signals, and one XY body, were considered normal. For 
characterization of ~H2AX, cells in pachytene or diplotene were scored, and we 
considered normal those where only a clearly identifiable XY body was covered 
by yH2AX signal. 

Prdm9 expression via RT-PCR analysis. To verify the correct expres- 
sion of the humanized Prdm9, we performed exon-spanning end- 
point RT-PCR on whole testis cDNA prepared using Tetro reverse 
transcriptase (Bioline) using a forward primer binding to exon 9 (5’-CATTAAG 
TGGGGAAGCAAGA-3’) and a reverse primer binding within the 3’ 
UTR, immediately downstream of the humanized zinc-finger domain 
encoded by exon 10 (5’-GGGATTTAATTCCCTTTTCTAGTCA-3’) 
(Extended Data Fig. 1b). Q-PCR analysis of Prdm9 transcripts was per- 
formed using two primer pairs (5’-GAATGAGAAAGCCAACAGCA-3/ and 
5'-GGACAACCAGACTGCACAGA-3’; 5’-AGCCAACAGCAATAAAACCA-3/ 
and 5‘-GGGATTTAATTCCCTTTTCTAGTCA-3’), amplifying regions 
within the 3’ UTR, normalizing against a housekeeping gene (Hprt; 5’- 
AGCTACTGTAATGATCAGTCAACG-3’ and 5‘-AGAGGTCCTTTT 
CACCAGCA-3’) using the Power SYBR Green PCR Master mix (Applied 
Biosystems) and a BioRad CFX96 cycler as per the manufacturer’s instructions. 
Relative expression was calculated using the Livak method. Expression of the 
humanized Prdm9 allele was unaffected by the genetic manipulation (Extended 
Data Fig. 1b, c). 

Single-stranded DNA sequencing and double-strand break (DSB) detec- 
tion. Testis cells from Bo", Be, wild-type PWD, the infertile (PWD x B6) 
F1?WD/®6, the reciprocal semi-fertile (B6 x PWD)F1?W?’®®, the humanized res- 
cue (PWD x B6)FI?WH, (Bo x CAST)F1°*CAST, (B6/CAST)F2°°" males were 
subjected to single-stranded DNA sequencing (SSDS) as previously described'’. 
In addition, we used the sample C57BL/6 (sample 1) from ref. 21 aligned to mm9/ 
NCBI37. This sample was also re-mapped to mm10/NCBI38 with a modified BWA 
mapper!’. Other samples from ref. 21, 9R (sample 2), 13R (samples 1 and 2) and 
Prdm9 knockout (B6~”) (sample 1)'°, were also used in the comparative analysis 
of DSB maps (Extended Data Fig. 3e). Bo™" and Bo? libraries were prepared 
in D. Camerini-Otero’s lab (NIH) and sequenced on a HiSeq 2000 platform, using 
paired-end reads (read 1: 36 bp; read 2: 40 bp). These samples were aligned to the 
mouse mm9/NCBI37 reference genome. 

Wild-type PWD, the infertile (PWD x B6)FIPWP!86 the reciprocal semi-fertile 
(B6 x PWD)F1?W?’®5, the reciprocal rescue (PWD x B6)F1?W/, (B6 x CAST) 
FIBS/CAST (B6/CAST)F2°° samples were prepared in The Wellcome Trust Centre 
for Human Genetics and sequenced on HiSeq 2000 and HiSeq 2500 platforms, 
using paired-end reads (51 bp for each read). These samples were aligned to 
the mouse mm10/NCBI38 reference genome with a modified BWA mapper’. 
Variation in the number of sequenced fragments results from the difficulty to 
precisely assess the DNA concentration before sequencing. Only fragments with 
high mapping quality (at least 20) were retained for DSB hotspot calling, and only 
one copy of each duplicate fragment was conserved (here, a fragment is duplicated 
if there exists at least one other fragment mapping to the same genomic position). 
Supplementary Table 1 gives details about the samples considered in this study. 
H3K4me3 ChIP-seq. ChIP-seq was performed as previously described** with 
several modifications (noted here). In brief, the testis tunica was removed, the 
tubules disassociated with tweezers and fixed in 1% formaldehyde in PBS for 5 min 
followed by glycine quenching (125 mM final conc.) for 5 min at room tempera- 
ture. Following washing steps, pellets were resuspended in 9001] cold RIPA lysis 
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buffer, dounced 20 times and sonicated in 300 11 aliquots in a Bioruptor Twin 
sonication bath at 4°C for three 10-min periods of 30s on, 30s off at high power, 
then cell debris was pelleted and removed and aliquots were pooled. For each 
sample, 50 il of equilibrated magnetic beads were resuspended in 10011 PBS/BSA 
and added to the chromatin samples for pre-clearing for 2h at 4°C with rotation. 
Beads were removed, and 100 il of pre-cleared chromatin was set aside for the 
input control. 5 yl rabbit polyclonal anti-H3K4me3 antibody (Abcam ab8580) was 
added to the remaining pre-cleared chromatin and incubated overnight at 4°C with 
rotation. 5011 beads were washed and resuspended as before, then incubated with 
the chromatin samples for 2h at 4°C with rotation. Beads were then washed and 
de-crosslinked at 65°C as described*®, and for input controls, 50 1 of pre-cleared 
chromatin was used. After de-crosslinking, samples were further incubated with 
801g RNase A at 37°C for 60 min and then with 80 1g Proteinase K at 55°C for 
90 min. DNA was purified using a Qiagen MinElute reaction cleanup kit. 

ChIP and total chromatin DNA samples were sequenced in multiplexed paired- 

end Illumina libraries, yielding 51-bp reads. We prepared two biological repli- 
cates plus one genomic input control each for the infertile (PWD x B6)FIPW/86, 
reciprocal (B6 x PWD)F13°/?WP, and rescue (PWD x Bo)FIPW?/" mice, yielding 
roughly 40-50 million usable read pairs per replicate. For the B6®”’"* and Bo!/# 
mice, we prepared one biological replicate each (yielding 70-80 million usable read 
pairs per sample) and later split read pairs into pseudoreplicates. Sequencing reads 
were aligned to mm10 using BWA aln*’ (v. 0.7.0) followed by Stampy*® (v. 1.0.23, 
option bamkeepgoodreads), and reads not mapped in a proper pair with insert size 
smaller than 10kb were removed. Read pairs representing likely PCR duplicates 
were also removed by samtools rmdup. Pairs for which neither read had a mapping 
quality score greater than 0 were removed. Fragment coverage was computed at 
each position in the genome and in 100-bp non-overlapping bins using in-house 
code and the samtools*’ and bedtools*’ packages. 
DSB hotspot detection and map comparison. To analyse DMCI1 data, we devel- 
oped a novel ChIP-seq peak caller, specific to DSB hotspots, which takes advantage 
of the shift in the mapping of single stranded DNA (ssDNA) reads between the 
5/ and the 3’ DNA strands to call hotspots. These ssDNA segments are a conse- 
quence of the resection of DNA ends that accompanies a DSB and are isolated by 
DMCI ChIP”. For each hotspot, the caller estimates in particular the centre of 
the hotspot, and its heat, loosely defined as the number of reads mapping to this 
DSB hotspot and predicted to represent real signal. The caller handles sample 
replicates and is able to call hotspots using several samples jointly. Details are 
given in Supplementary Information. DSB hotspots from two different samples 
are considered to overlap if their centres are at most 600 bp apart. DMC1 hotspot 
heats have been normalized so that the sum of hotspot heats is identical in each 
sample (and equals the sum of hotspot heats in B6®“®* (sample 1)). 

H3K4me3 enrichments have been computed at DSB hotspots identified by 
DMCI ChIP-seq, using our previously published method** (Supplementary 
Information Subsection 7.1). H3K4me3 hotspots have also been called de novo, 
without using DSB hotspots, using the same approach*®. The de novo calls were 
used to generate a list of regions likely to be trimethylated independently of 
PRDM%, by intersecting calls in mice with different Prdm9 alleles. In comparisons 
involving both DMC1 and H3K4me3 data, we excluded DSB hotspots contained 
in any of the PRDM9-independent trimethylated regions, and we used H3K4me3 
enrichments computed at DSB hotspots (Supplementary Information). We only 
used de novo calls for analysis in Extended Data Fig. 6d, e. 

DNA binding motif analyses. We developed a new, Bayesian, approach to identify 
DNA motifs enriched at DSB hotspots (Supplementary Information). We used 
FIMO (MEME Suite version 4.9.1) to find the locations of those motifs genome- 
wide. Using Mus famulus and Mus caroli as outgroups, we reconstructed an ances- 
tral reference genome for B6 and PWD. We could therefore identify on which 
lineage (B6 or PWD) mutations between these two mouse strains occurred. See 
Supplementary Information for details. 

DSB hotspot assignment in hybrids. Using SNPs between the B6 and PWD 
genomes, each read pair from a hybrid DSB library (DMC1 ChIP-seq) is assigned 
to one of the categories ‘B6, ‘PWD; ‘unclassified’ or ‘uninformative’ using criteria 
detailed in Supplementary Information. For each DSB hotspot, the ratio of 


informative reads from the B6 chromosome was then computed as the fraction of 
‘BG reads mapped within 1 kb of the hotspot centre, over the sum of ‘B6 and ‘PWD’ 
reads in that region. We followed a similar approach for H3K4me3 ChIP-seq, but 
we further corrected for background signal. 

Chromosome effects. To test for statistically significant differential elevation 
of DMC1 (or H3K4me3) heats between chromosomes following Prdm9 
humanization of the infertile (PWD x B6)F1PY’®6 mice, we fitted a quasi- 
Poisson model to these heats, including predictors for each chromosome. 
Specifically, we estimated parameters a, 3 and ¥ by fitting the model 
log[E (dintertilel rescues c)] =at+y log[drescue] + ee 1 BP emip where dinfertile and 
drescue are the DMC1 heats of a particular hotspot which is shared between the 
infertile (PWD x B6)F1?W?’®6 and rescue (PWD x B6)F1P“P/4 mice and c is a 
categorical variable which represents the chromosome on which the DSB hotspot 
occurs. Furthermore, for one of the hybrid mice we considered, for a given auto- 
some, we defined the ‘total H3K4me3 signal from PRDM9 binding on both homo- 
logues (that is, symmetrically) at the same hotspots, summed over the entire 
chromosome; also referred to as ‘the sum of ‘symmetric heats; as yS 4r(1 — 13) hi, 
where 1; is the fraction of DMC1 reads coming from the B6 chromosome for 
hotspot i, h; is the H3K4me3 heat of that hotspot, and the sum is taken over all 
the hotspots on that chromosome which are under the control of a specific (PWD, 
B6, or humanized) PRDM9. (Our analyses always refer to this sum of symmetric 
heats for a specific allele.) When we considered the B6 mouse (which of course 
has two B6 chromosomes), we defined this sum of symmetric heats to be >; h? 
(which is the special case of the formula above with r; = +, corresponding to all 
hotspots being fully symmetric). Under the assumptions we describe in the 
Supplementary Information, this can also be interpreted as being proportional to 
the expected number of hotspots with PRDM9 bound on both homologues. 
Details and motivations for defining this quantity are given in Supplementary 
Information Section 8, together with a slight adjustment we used in practice to 
provide robustness against outliers in the value of h;. 

We proceeded similarly in the B6*®°_B6®°" comparison. The observed effects 

reported in Fig. 2d-f and Extended Data Fig. 7 are normalized to the effect for 
Chromosome 1. Precise definitions for the model, and for the 14 chromosome 
effect predictors tested, are given in Supplementary Information. 
Analysis code availability and source data. Analysis code used for analysis in this 
study is available at https://github.com/anjali-hinch/hybrid-rescue. The source 
data generated in this publication has been deposited in NCBI’s Gene Expression 
Omnibus (accession number GSE73833). 
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Extended Data Figure 1 | Humanization of the zinc-finger domain of 
Prdm9. a, Top, the targeting vector used for the humanization of the zinc- 
finer array encoded by a portion of exon 10. Middle, wild-type Prdm9 
allele. Bottom, the targeted humanized allele, following the action of Flp 
recombinase which removes the FRT-flanked neomycin selection cassette. 
The positions of primers used for the exon spanning RT-PCR are shown 
along with the sizes of the predicted amplification products from cDNA. 


nde fill | 


attP 


Humanized FRT 
Exon 10 

c 194 
2 
n 
no 
fod) 
—_ 
[om 
x< 
fod) 
fod) 
2055 
7 
& 
fo) 
ao 

0.05 


B6/B6 


B6/H H/H 


b, RT-PCR analysis using the exon spanning primers shown in a from testis 
cDNA prepared from wild-type (B6"°’®°) and heterozygous humanized 
(B68!) mice. For gel source data, see Supplementary Fig. 1. c, Relative 
expression of the Prdm9 transcript from testis cDNA prepared from wild- 
type (B6®*/®>), heterozygous (B6®°#) and humanized (B6"/*) testis cDNA, 
normalized to Hprt (n= 2 for each genotype). Error bars, +1 s.e. 
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Extended Data Figure 2 | Effects of the humanization of the Prdm9 
zinc-finger domain on fertility parameters. a, The average litter size is 
shown for all combinations of genotype matings. Error bars, +1 s.e. 

b, Numbers of DMC1 foci co-localizing with SYCP3 immunoreactivity per 
cell, grouped according to meiotic stage (wild-type (B6®”""*), n =5 mice; 
heterozygous (B6®*"), n=7 mice; homozygous (B6!""), n= 6 mice; cell 
numbers counted: zygotene: 32, 38, 37; zygotene/pachytene: 55, 96, 90; 
pachytene: 188, 210, 176; signals on XY in pachytene: 188, 210, 175 for 
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BoP/86, B6P/H and B6H/, respectively). Mean values are shown c, Number 
of MLH1 foci per cell in pachytene stage meiotic spreads. (B6"*'®°: n=6 
mice, 180 cells; B6®*/4: n =6 mice, 185 cells; B6""!: n = 6 mice, 183 cells). 
Mean values are shown. d, Comparison of fertility metrics in four mice 
with homozygous genetic background (B6 or PWD). Across all four mice, 
there is no statistically significant evidence of differences in these fertility 
parameters (ANOVA, Bonferroni-corrected P values > 0.08). Error bars, 
+1s.e. e, Average litter sizes in F1 crosses. Error bars, +1 s.e. 
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Extended Data Figure 3 | Further features revealed by DMC1 signal 
analysis in mice with homozygous genetic background. a, Effect of 
humanization of the Prdm9 zinc-finger domain on DSB hotspots. A total 
of 16,225 and 17,517 DSB hotspots were localized in the homozygous 
humanized and wild-type mice, respectively. Only 2.6% of these hotspots 
overlap. b, Correlations between DSB hotspot maps at different scales. 
Autosomes are divided into bins of given length, and correlations between 
the sums of the heats of the hotspots falling into each bin are reported, for 
different bin sizes. The grey region indicates the empirical 95% confidence 
envelope for the correlation under the null hypothesis of no association 
between the B6®°’86 and Bo!!/# DSB maps. DSB maps for B6"“/®°, 13R, 9R 
and Prdm9 knockout (B6~/~) mice come from ref. 21. B6®*®* and 9R have the 
same Prdm9 allele, but different genomic backgrounds. c, Breakdown of 
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Relative heat of DSB hotspots 


hotspot provenance (defined by overlap) in the heterozygous humanized 
mouse for all DSB hotspots (left panel) and for the hottest 20% of hotspots 
(right panel). d, Distributions of hotspot provenance in the heterozygous 
humanized mice as a function of the estimated hotspot heats (blue, wild- 
type B6 mouse; red, humanized homozygous mouse; green, humanized 
heterozygous mouse; purple, undetermined). The human allele dominates 
over the mouse allele in terms of heat, as the proportion of DSB hotspots 
found in the heterozygous mouse that are shared with the homozygous 
humanized mouse increases with estimated heat. The relative heat/strength 
of a hotspot is the ratio of this hotspot’s estimated heat to the sum of all 
the estimated heats (on autosomes). e, Hotter hotspots present a PRDM9 
binding motif more often than weaker hotspots in all samples (same 
colour legend). 
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Extended Data Figure 4 | Inferred PRDM9 binding motifs are enriched 
at DSB hotspot centres. a-d, Refined PRDM9 binding motifs detected 

in the wild-type B6 mouse (a), in the homozygous humanized mouse (b), 
in the heterozygous humanized mouse (c) and in wild-type PWD (d). 
Percentages above each motif indicate the fraction of DSB hotspots that 
are found to harbour this motif, with each DSB hotspot assigned at most 
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to one motif. In logo plots, letter height in bits of information determines 
degree of base specificity. e-g, Enrichment of the most prevalent 15-bp 
wild-type (blue) and humanized homozygous (red) motifs within 

100-bp bins across a 5-kb window centred on the DSB hotspot centres. 
Enrichments were computed for the wild-type (e), homozygous 
humanized (f) and heterozygous humanized (g) mice DSB hotspots. 
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Extended Data Figure 5 | Differential epigenetic mark distributions 
at PRDM9 binding motifs. a, Enrichment of H3K4me3 marks at mouse 
motifs that are either within a B6 (left) or human (right) PRDM9 allele 


c, Mean coverage of H3K4me3 (left) or H3K36me3 (right) signal around 
the mouse motif nearest to each B6 DSB hotspot, split according to the 
strand on which the motif lies. d-h, As a, but for H3K9ac (d), H3K27ac 
controlled DSB hotspot, or outside such a hotspot. The enrichment is (e), H3K27me3 (f), H3K4mel (g) or H3K79me2 (h) marks. All ChIP-seq 
relative to a control genomic track. Given the spread of the distributions, data for histone modifications used in this analysis were obtained from the 
the interaction range between the histones and the DSB hotspot seems to Mouse ENCODE Project. 

be ~1.5 kb on each side of the motif. b, As a, but for H3K36me3 marks. 
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Extended Data Figure 6 | Further features of DSB hotspot asymmetry. 
a-c, DSB hotspot asymmetry in (B6 x PWD)F1°°WP and in (B6 x PWD) 
F1]#/PWD_ a, Distribution of the fraction of (DMC1) informative reads 
originating from the B6 chromosome in the reciprocal (B6 x PWD)F139?WPD 
mouse. PRDM9 control at each DSB is attributed either to the B6 allele 
(blue) or the PWD allele (pink) or is undeterminable (grey). b, c, Asa, 

but showing fractions only for non-shared hotspots, unique to either the 
reciprocal (B6 x PWD)F1°*?WP (pb) or the reciprocal rescue (B6 x PWD) 
F1H/PWD (¢) mice. d, e, Comparison of the levels of asymmetric binding 

in the (PWD x Bo)F1?W?’®6 and (B6 x CAST)FI2°C4ST mice, using 
H3K4me3 signal. d, Distributions of the fraction of H3K4me3 reads from 
the B6 chromosome in the two mice. We used raw data from ref. 17 for the 
(B6 x CAST)F12°CAST mouse, and processed both data sets in the same 
way. H3K4me3 heats were capped at the 95th percentile in each case, and 
only H3K4me3 binding peaks not inferred to be independent of PRDM9 
binding (Supplementary Information Section 7), and overlapping with a 
DMC1 hotspot in the same mouse, were considered. e, Quantile-quantile 
plot for the distributions shown in d (blue). Dark grey: y = x line; light 


0 
Position around motif (bp) 


grey: 95% confidence band. f, Density plot comparing, for each hotspot in 
the (PWD x B6)F1?“?’° mouse, its DMC1 and H3K4me3 asymmetries. 
The correlation between the two measures is 0.93. g, Mutations within 
1-kb regions around B6 and PWD PRDM9 motifs, on the B6 and PWD 
genomes. Main plot, for each combination of motif and lineage (PWD 

or B6), we plot the fraction of 30-bp windows, along the 1-kb regions 
surrounding motif occurrences within DSB hotspots, where at least one 
SNP or indel mutation occurred along the respective lineage. Inset plot, 
distribution of motif score differences (derived-ancestral) for motif 
changes shown in the main plot. Motif score was defined as the logarithm 
of the probability that a motif was drawn from the motif’s position weight 
matrix, in the ancestral sequence and in the current-day mouse. A negative 
difference indicates the motif match worsened along the corresponding 
lineage. This panel is based on the (PWD x B6)F1?“/86 DMC1 map. 

h, Mutations within 1-kb regions around B6 PRDM9 motifs, on the B6 
and CAST genomes, as in g, using the (B6 x CAST)F1®°/CAST DMCI map. 
We see no evidence of erosion of B6 PRDM9 motifs on the CAST genome. 
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Extended Data Figure 7 | Chromosome effects following Prdm9 these two mice. b, Comparison of the observed chromosome effects for 
humanization in B6®’®®, a, Individual chromosome effects (relative DMCI signals with the fitted chromosome effects, using the two-predictor 
to chromosome 1) when comparing DMC1 signals in the B6®* mouse model including the sum of symmetric H3K4me3 heats in B6®**° and in 


relative to the B6"*/®* mouse, for the DSB hotspots that are shared between __B6"/". Bars conservatively show 3 standard errors in both plots. 
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Extended Data Figure 8 | Value by chromosome and sensitivity analysis | chromosome specific metric, to compute the average metric genome- 

for the symmetry metric. a, Symmetry metric, as defined in the main wide. Both metrics are computed using the DMC1 maps. c, Alternative 
text, for each sample (ALL), and for each autosome amongst those symmetric metrics using H3K4me3 maps, similarly to b. The threshold of 
samples. Error bars represent bootstrap 95% confidence intervals in all 12,540 hotspots per sample corresponds to the number of hotspots with 
panels. b, Alternative symmetric metrics (to the ones reported in the main _ ratio estimates in the (PWD x B6)F1?“/" mouse, which was the lowest 
text), using only 10,000 hotspots per sample, or without weighting each amongst the three samples shown here. 
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Extended Data Figure 9 | See next page for figure caption. 
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Extended Data Figure 9 | Asymmetric hotspots, hotspots on the 

X chromosome and hotspots opposite deletions show systematic 
increase of DMC1 heat, relative to symmetric hotspots. a, For the 
PWD allele in the humanized rescue (PWD x B6)F1?Y?/4 mouse, mean 
DMCI signal is plotted in decile bins of H3K4me3 enrichment on the 

B6 chromosome (or the PWD X chromosome), with error bars showing 
95% confidence intervals and lines of best fit (as in Fig. 4c). The slope 

of the line for asymmetric hotspots is 2.5-fold greater than that of the 
symmetric hotspots, and the slope for hotspots on the X chromosome is 
5.2-fold greater, illustrating that the DMC1 signal at asymmetric sites is 
elevated in a similar fashion to hotspots on the X chromosome, which do 
not repair until late in meiosis. We found similar results in all cases tested. 
b, Comparison of DMC1 heats on B6 chromosome for hotspots shared 
between the humanized B6"/4 and the humanized rescue (PWD  B6) 
F1?WD/H mice, under humanized PRDM9 control. We show symmetric 
hotspots (fraction of DMC1 informative reads between 0.4 and 0.6, 
green), and hotspots opposite a deletion on the PWD chromosome 
(deletion of at least 200 bp, encompassing a human PRDM9 binding motif, 
black). The black line represents the median DMCI heat for symmetric 
hotspots. c, As b, but showing the asymmetric hotspots (fraction of 
DMC1 informative reads above 0.9, red), with the corresponding median 
line. Hotspots opposite PWD deletion show a significant elevation in 
DMCI heat relative to symmetric hotspots (14 of 16 hotspots above the 
symmetric median line, P= 0.004). This elevation is similar to the one 
showed by asymmetric hotspots (9 of 16 hotspots above the asymmetric 
median line, P= 0.80). d, Bar plot showing the genome-wide autosomal 
ratio of mean DMC1 heat to mean H3K4me3 enrichment for asymmetric 
hotspots relative to symmetric hotspots in 9 scenarios studied, each for 

a different combination of mouse, Prdm9 allele, and haplotype, with 
error bars representing 95% bootstrap confidence intervals for the ratio 


of means. In all cases, asymmetric hotspots show an elevation in DMC1 
signal for a given H3K4me3 signal. e, Ratio of mean DMC1 and H3K4me3 
signals on the B6 chromosome for the humanized allele in the humanized 
rescue mouse. Hotspots are clustered according to the fractions of their 
H3K4me3 signal that is on the B6 chromosome (r), and the ratio of the 
mean DMCI1 and H3K4me3 signals in each class is shown here. The 
whiskers show 95% confidence intervals for the mean, estimated using 
bootstrapping. When r > 0.5, the B6 chromosome has greater H3K4me3 
than the PWD chromosome, and vice versa. The ratio could not be 
estimated for r < 0.01 owing to H3K4me3 levels being zero or nearly zero 
in those cases. f, Left, ratio of mean DMC1 and H3K4me3 signals on 

the B6 chromosome compared with the H3K4me3 signal on the PWD 
chromosome (log scale) in the infertile mouse. Asymmetric hotspots were 
defined as those with H3K4me3 fraction on the B6 chromosome >0.9, 
and symmetric hotspots were those with the fraction between 0.1 and 

0.9. Hotspots that we estimated to be completely asymmetric (H3K4me3 
fraction = 0 on either chromosome) or those with H3K4me3 enrichment 
on either chromosome close to zero (enrichment < 0.05) were excluded 
to avoid singularities on either axis. Asymmetric hotspots were binned 
into four bins of equal size and symmetric hotspots were binned into ten 
bins of equal size. Different numbers of bins were used for asymmetric 
and symmetric hotspots to get approximately similar confidence intervals 
(error bars represent 95% confidence intervals) to enable comparison. We 
did not observe many weak symmetric hotspots as we have limited power 
to detect such hotspots, which is why there are no symmetric bins with 
very low H3K4me3 levels on the homologue (right). Right, as left, but with 
the ratio determined for the PWD chromosome relative to H3K4me3 on 
the B6 chromosome. Accordingly, asymmetric hotspots are defined as 
those with H3K4me3 fraction on the PWD chromosome >0.9. 
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a Infertile mouse, B6 PRDM9, b 
matched on DMC1 heat in B6°*®* 


Infertile mouse, B6 PRDM9, c 
matched on local heterozygosity (500bp) 
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Infertile mouse, B6 PRDM9, 
matched on local heterozygosity (1kb) 
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Extended Data Figure 10 | Elevation of DMC1 asymmetric heat is not in both mice. h, As g, but for the DMC1 heats. Despite similar 


explained by GC content, local heterozygosity, differences in binding- 
motif-disrupting mutations or by outliers. a~f, Comparison of DMC1 
signals in the infertile (PWD x B6)F1?“/®° mouse, at symmetric and 
asymmetric hotspots respectively, binned by H3K4me3 enrichment, after 
matching symmetric and asymmetric hotspots on various features: 

a, DMCI heat in B6®”’®*; b, local heterozygosity outside the PRDM9 binding 
motif, in a 500-bp window; ¢, as b, but for a 1-kb window; d, number 

of SNPs in binding motif; e, number of indels in binding motif; f, local 

GC content, computed in a 200-bp window around hotspot centre. 

g, Distributions of the ratios of H3K4me3 heats on the B6 chromosome, 

in the rescue (PWD x B6)F1?“?4 versus humanized B6#/# mice, for the 
symmetric (fraction of informative DMC] reads in the range 0.4-0.6, 
green) and asymmetric (fraction 0.9-1, orange) hotspots under humanized 
PRDM9 control shared between the two mice. The distributions are very 
close, suggesting similar trimethylation by PRDM9 on the B6 chromosome 


trimethylation marking by PRDM9 in both mice, we observed notable 
changes in the distribution of DMCI1 ratios. This could be due to either 
more breaks occurring at the asymmetric sites, or a longer time taken to 
repair them. i, Quantile-quantile plots of DMC1 heats for hotspots under 
the control of the humanized allele on the B6 chromosome in the rescue 
(PWD x B6)FIPWP'4 (y axis, left) versus the humanized Bo! (x axis) 
mice, for symmetric (green) and asymmetric (orange) hotspots. Dotted 
line represents the ratios of asymmetric to symmetric quantiles (excluding 
distribution tails; y axis, right). Dashed line represents expected ratio if 
there were no differences between symmetric and asymmetric hotspots. 
The observed ratio of DMC1 quantiles is constant across DMC1 heats, 
emphasizing that the increase in DMC1 heat at asymmetric sites is very 
similar across the whole range of DMC1 heats, and does not simply result 
from a few outlying hotspots. 
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Schizophrenia risk from complex 
variation of complement component 4 
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Robert E. Handsaker!?, Schizophrenia Working Group of the Psychiatric Genomics Consortium*, Mark J. Daly*°, 


Michael C. Carroll°, Beth Stevens2* & Steven A. McCarroll! 


Schizophrenia is a heritable brain illness with unknown pathogenic mechanisms. Schizophrenia’s strongest genetic 
association at a population level involves variation in the major histocompatibility complex (MHC) locus, but the genes 
and molecular mechanisms accounting for this have been challenging to identify. Here we show that this association 
arises in part from many structurally diverse alleles of the complement component 4 (C4) genes. We found that these 
alleles generated widely varying levels of C4A and C4B expression in the brain, with each common C4 allele associating 
with schizophrenia in proportion to its tendency to generate greater expression of C4A. Human C4 protein localized 
to neuronal synapses, dendrites, axons, and cell bodies. In mice, C4 mediated synapse elimination during postnatal 
development. These results implicate excessive complement activity in the development of schizophrenia and may help 
explain the reduced numbers of synapses in the brains of individuals with schizophrenia. 


Schizophrenia is a heritable psychiatric disorder involving impair- 
ments in cognition, perception, and motivation that usually manifest 
late in adolescence or early in adulthood. The pathogenic mechanisms 
underlying schizophrenia are unknown, but observers have repeatedly 
noted pathological features involving excessive loss of grey matter!”, 
and reduced numbers of synaptic structures on neurons* >. Although 
treatments exist for the psychotic symptoms of schizophrenia, there is 
no mechanistic understanding of, nor effective therapies to prevent or 
treat, the cognitive impairments and deficit symptoms of schizophrenia, 
which are the earliest and most constant features of the disorder. An 
important goal in human genetics is to find the biological processes 
that underlie such disorders. 

More than 100 loci in the human genome contain single nucleotide 
polymorphism (SNP) haplotypes that associate with risk of schizo- 
phrenia®; however, the functional alleles and mechanisms at these loci 
remain to be discovered. By far the strongest such genetic relationship 
is schizophrenia’s association with genetic markers across the major his- 
tocompatibility complex (MHC) locus, which spans several megabases 
(Mb) of chromosome 6 (refs 6-10). The MHC locus is best known for its 
role in immunity, containing 18 highly polymorphic human leukocyte 
antigen (HLA) genes that encode a vast suite of antigen-presenting mol- 
ecules. In some autoimmune diseases, genetic associations at the MHC 
locus arise from alleles of HLA genes!) !?; however, schizophrenia’s 
association to the MHC has not yet been explained. 

Though the functional alleles that give rise to genetic associations 
have in general been challenging to find, the schizophrenia-MHC 
association has been particularly challenging because schizophre- 
nia’s complex pattern of association to markers in the MHC locus 
spans hundreds of genes and does not correspond to the linkage 
disequilibrium (LD) around any known variant®!”. This prompted 
us to consider cryptic genetic influences that might generate uncon- 
ventional genetic signals. The most strongly associated markers in 


several large case/control cohorts were near a complex, multi-allelic, 
and only partially characterized form of genome variation that affects 
the C4 gene encoding complement component 4 (Extended Data 
Fig. 1) . The association of schizophrenia to CSMD1 (refs 6, 10), 
which encodes a regulator of C4 (ref. 13), further motivated us to 
consider C4. 


C4 structures and MHC SNP haplotypes 

Human C4 exists as two functionally distinct genes (isotypes), C4A and 
C4B; both vary in structure and copy number. One to three C4 genes 
(C4A and/or C4B) are commonly present as a tandem array within 
the MHC class III region (Fig. 1a and Extended Data Fig. 1g)!*"1*. 
The protein products of C4A and C4B bind different molecular 
targets’®*°. C4A and C4B segregate in both long (L) and short (S) 
genomic forms (C4AL, C4AS, C4BL and C4BS), distinguished by the 
presence or absence (in intron 9) of a human endogenous retroviral 
(HERV) insertion that lengthens C4 from 14 to 21 kb without changing 
the C4 protein sequence’ (Fig. 1b). 

We developed a way (Extended Data Fig. 2) to identify the ‘struc- 
tural haplotypes’ of C4—the copy number of C4A and C4B and the 
long/short (HERV) status of each C4A and C4B copy—present on 222 
copies of human chromosome 6. Using droplet digital PCR (ddPCR), 
we found that genomes contained 0-5 C4A genes, 0-3 C4B genes, 
1-5 long (L) C4 genes, and 0-3 short (S) C4 genes (Extended Data 
Fig. 2a, b). We also developed assays to determine the long/short 
(HERV) status of each C4A and C4B gene copy (Extended Data Fig. 2c), 
thus revealing copy number of C4AL, C4BL, C4AS, and C4BS in each 
genome (Supplementary Methods). 

We analysed inheritance in father-mother-offspring trios (Extended 
Data Fig. 2d) to identify the C4A and C4B contents of individual alleles 
(Extended Data Fig. 2e). We found that four common C4 structural 
haplotypes (AL-BL, AL-BS, AL-AL, and BS) were collectively present 
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Figure 1 | Structural variation of the complement component 4 (C4) 
gene. a, Location of the C4 genes within the major histocompatibility 
complex (MHC) locus on human chromosome 6. b, Human C4 exists 

as two paralogous genes (isotypes), C4A and C4B; the encoded proteins 
are distinguished at a key site that determines which molecular targets 
they bind!””°. Both C4A and C4B also exist in both long (L) and short (S) 
forms distinguished by an endogenous retroviral (C4-HERV) sequence in 
intron 9. c, Structural forms of the C4 locus and their frequencies among 
a European-ancestry population sample (222 chromosomes from 111 
genetically unrelated individuals, HapMap CEU), inferred as described in 
Extended Data Fig. 2. Asterisks indicate allele frequencies too low to be 
estimated accurately. 


on 90% of the 222 independent chromosomes sampled; 11 uncommon 
C4 haplotypes comprised the other 10% (Fig. Ic). 

The series of many SNP alleles along a genomic segment (the SNP 
haplotype) can be used to identify chromosomal segments that come 
from shared common ancestors. We identified the SNP haplotype(s) 
on which each C4 locus structure was present (Fig. 2). The three most 
common C4 locus structures were each present on multiple MHC SNP 
haplotypes (Fig. 2). For example, the C4 AL-BS structure (frequency 
31%) was present on five common haplotypes (frequencies 4%, 4%, 
4%, 8%, and 6%) and many rare haplotypes (collective frequency 5%, 
Fig. 2). Reflecting this haplotype diversity, each of these C4 structures 
exhibited real but only partial correlation to individual SNPs (Extended 
Data Fig. 3). The relationship between C4 structures and SNP 
haplotypes was generally one-to-many: a C4 structure might be present 
on many haplotypes, but a given SNP haplotype tended to have one 
characteristic C4 structure (Fig. 2). 


C4 expression variation in the brain 

As C4A and C4B vary in both copy number and C4-HERV status 
(Fig. 1), and because other HERVs can function as enhancers*!~?3, 
C4 variation might affect expression of C4 genes. We assessed how 
C4 structural variation related to RNA expression of C4A and C4B in 
eight panels of post-mortem human adult brain samples (674 samples 


from 245 distinct donors in 3 cohorts, Supplementary Methods). 
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The results of this expression analysis were consistent across all 
five brain regions analysed. First, RNA expression of C4A and C4B 
increased proportionally with copy number of C4A and C4B respec- 
tively (Fig. 3a, b and Extended Data Fig. 4). These observations mirror 
earlier observations in human serum”. Second, expression levels of 
C4A were two to three times greater than expression levels of C4B, 
even after controlling for relative copy number in each genome 
(Fig. 3c). Third, copy number of the C4-HERV sequence increased the 
ratio of C4A to C4B expression (P< 107’, P< 10-*, P< 107°, respec- 
tively, in the three cohorts examined, by Spearman rank correlation) 
(Fig. 3c and Extended Data Fig. 4). 

We used the above data to create genetic predictors of C4A and 
C4B expression levels in the brain (Supplementary Methods). If C4A 
or C4B expression levels influence a phenotype, then the aggregate 
genetic predictor might associate to schizophrenia more strongly than 
individual variants do. 


C4 structural variation in schizophrenia 

Schizophrenia cases and controls from 22 countries have been analysed 
genome-wide for SNPs, implicating the MHC locus as the strongest of 
more than 100 genome-wide-significant associations®. Our analysis 
indicated that long haplotypes defined by many SNPs carry character- 
istic C4 alleles (Fig. 2), potentially making it possible to infer C4 alleles 
by statistical imputation” from combinations of many SNPs. We used 
our 222 integrated haplotypes of MHC SNPs and C4 alleles (Fig. 2) as 
reference chromosomes for imputation. We found that the four most 
common structural forms of the C4A/C4B locus (BS, AL-BS, AL-BL, 
and AL-AL) could be inferred with reasonably high accuracy (generally 
0.70 <r’ < 1.00, Supplementary Table 2). 

We then analysed SNP data from 28,799 schizophrenia cases and 
35,986 controls, from 40 cohorts in 22 countries contributing to the 
Psychiatric Genomics Consortium (PGC)°. We evaluated association 
to 7,751 SNPs across the extended MHC locus (chr6: 25-34 Mb), to 
C4 structural alleles (Fig. 1c), and to HLA sequence polymorphisms 
imputed from the SNP data. We also predicted levels of C4A and C4B 
expression from the imputed C4 structural alleles. 

The association of schizophrenia to these genetic variants exhibited 
two prominent features (Fig. 4a, b). One feature involved a large set 
of similarly associating SNPs spanning 2 Mb across the distal end of 
the extended MHC region (we use this set’s most strongly associating 
SNP, rs13194504, as its genetic proxy). The other peak of association 
centred at C4, where schizophrenia associated most strongly with the 
genetic predictor of C4A expression levels (P=3.6 x 10-4) (Fig. 4a and 
Extended Data Fig. 5). In the region near C4 (chromosome 6, 31-33 Mb), 
the more strongly a SNP correlated with predicted C4A expression, the 
more strongly it associated with schizophrenia (Fig. 4b, bottom panel). 

Although the variation at C4 and in the distal extended MHC region 
associated with schizophrenia with similar strengths (P= 3.6 x 10-*4 
and 5.5 x 10778, respectively), their correlation with each other was low 
(7? =0.18, Fig. 4b), suggesting that they reflect distinct genetic influ- 
ences. Conditional analysis confirmed this: in analyses controlling 
for either rs13194504 or genetically predicted C4A expression, the 
other genetic variable still defined a genome-wide-significant associ- 
ation peak (P=7.8 x 10-!9 and 8.0 x 1074, respectively, Fig. 4c, d). 
Controlling for both genetic variables revealed a third association sig- 
nal just proximal to the MHC locus (Fig. 4e) involving SNPs around 
BAK1 and SYNGAP1, the latter of which encodes a major compo- 
nent of the postsynaptic density; de novo loss-of-function muta- 
tions in SYNGAP1 associate with autism. In joint analysis, all three 
genetic signals remained significant (P= 8.0 x 10~'4, 2.8 x 1078, and 
1.7 x 10-8, respectively) and no additional genome-wide-significant 
signals remained in the MHC locus (Fig. 4f). 

In some autoimmune diseases with genetic associations in the 
MEC locus, alleles of HLA genes associate more strongly than 
do other variants in the MHC locus, appearing to explain the 
associations!!!*, In contrast, in schizophrenia, classical HLA alleles 
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Figure 2 | Haplotypes formed by C4 structures and SNPs. SNP 
haplotype(s) on which common C4 structures were present. Each thin 
horizontal line represents the series of SNP alleles (haplotype) along a 

250 kilobase (kb) chromosomal segment. Each column represents a SNP; 
grey and black indicate which allele is present on each haplotype. The SNP 


associated with schizophrenia less strongly than other genetic variants 
in the MHC region did (Extended Data Fig. 6). We further considered 
the strongest schizophrenia associations to classical HLA alleles at dis- 
tinct loci (involving HLA-B*0801, HLA-DRB1*0301, and HLA-DQB1* 
02); conditional analysis indicated that each could be explained by LD 
to the stronger signals at C4 and rs13194504 (Extended Data Fig. 7). 

If each C4 allele affects schizophrenia risk via its effect on C4A 
expression, then this relationship should be visible across specific 
C4 alleles. We measured schizophrenia risk levels for the common 
C4 structural alleles (BS, AL-BS, AL-BL, and AL—AL); these alleles 
showed relative risks ranging from 1.00 to 1.27 (Fig. 5a). We also esti- 
mated (from the post-mortem brain samples) the C4A expression 
levels generated by these four alleles (Fig. 5b). Schizophrenia risk and 
C4A expression levels yielded the same ordering of the C4 allelic series 
(Fig. 5a, b). 
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Figure 3 | Brain RNA expression of C4A and C4B in relation to copy 
numbers of C4A, C4B, and the C4-HERV. a, b, mRNA expression of 
C4A (a) and C4B (b) was measured (by ddPCR) in brain tissue from 244 
individuals. The copy numbers of C4A, C4B, and the C4-HERV were 
measured (by ddPCR) in genomic DNA from the brain donors. The 
results were consistent across 8 panels of brain tissue representing 

5 brain regions and 3 distinct sets of donors (one set shown here, with 
data from 101 individuals; all panels in Extended Data Fig. 4; a few outlier 
points are beyond the range of these plots but are shown in Extended 
Data Fig. 4.) P values were obtained by a Spearman rank correlation test. 
In ¢, expression of C4A (per genomic copy) is normalized to expression of 
C4B (per genomic copy) to control for trans-acting influences shared by 
C4A and C4B. 


haplotypes are grouped into 13 sets of haplotypes associating with each of 
the four most common C4 structures. Three C4 structures (AL-BS, 
AL-BL, and AL-AL) each segregated on multiple SNP haplotypes 
(numbered at right). 


We sought an even more stringent test. If this allelic series of relation- 
ships with schizophrenia risk (Fig. 5a) arises from C4 locus structure— 
rather than from other genetic variation in the MHC locus—then a 
given C4 structure should exhibit the same schizophrenia risk regard- 
less of the MHC haplotype on which it appears. We measured the schiz- 
ophrenia association of all 13 common combinations of C4 structure 
and MHC SNP haplotype (Fig. 5c). Across this allelic series, each C4 
allele exhibited a characteristic level of schizophrenia risk, regardless 
of the haplotype on which it was present (Fig. 5c). 


C4A RNA expression in schizophrenia 

These genetic findings (Fig. 5a, c) predict that C4A expression might be 
elevated in brain tissue from schizophrenia patients. We measured C4A 
RNA expression levels in brain tissue from 35 schizophrenia patients 
and 70 individuals without schizophrenia. The median expression of 
C4A in brain tissues from schizophrenia patients was 1.4-fold greater 
(P=2 x 10~° by Mann-Whitney U-test; Fig. 5d) and was elevated in 
each of the five brain regions assayed (Extended Data Fig. 8). This 
relationship did not meaningfully change in analyses adjusted for age 
or post-mortem interval. The relationship remained significant after 
correcting for the higher average C4A copy number among the brain 
donors affected with schizophrenia (1.3-fold greater, P=0.002). Some 
earlier studies have also reported elevated levels of complement pro- 
teins in serum of schizophrenia patients”””*. 


C4 in the central nervous system 

C4 is a critical component of the classical complement cascade, an 
innate immune system pathway that rapidly recognizes and eliminates 
pathogens and cellular debris. In the brain, other genes in the classical 
complement cascade have been implicated in the elimination or ‘prun- 
ing’ of synapses’). 

To evaluate the distribution of C4 in human brain, we performed 
immunohistochemistry on sections of the prefrontal cortex and 
hippocampus. We observed C4* cells in the grey and white matter, 
with the greatest number of C4* cells detected in the hippocampus. 
Co-staining with cell-type-specific markers revealed C4 in subsets 
of NeuN* neurons (Fig. 6a; antibody specificity further evaluated in 
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Figure 4 | Association of schizophrenia to C4 and the extended MHC 
locus. Association of schizophrenia to 7,751 SNPs across the MHC locus 
and to genetically predicted expression levels of C4A and C4B in the brain 
(represented in the genomic location of the C4 gene). The data shown are 
based on analysis of 28,799 schizophrenia cases and 35,986 controls of 
European ancestry from the Psychiatric Genomics Consortium. The height 
of each point represents the statistical strength (—log)o(P)) of association 
with schizophrenia. a, b, Association of schizophrenia to SNPs in the 
MHC locus and to genetically predicted expression of C4A and C4B. In b, 
genetic variants are coloured by their levels of correlation to rs13194504 
(upper panel) or by their levels of correlation to genetically predicted brain 
C4A expression levels (lower panel). c-f, Conditional association analysis. 
The red dashed line indicates the statistical threshold for genome-wide 
significance (P=5 x 10-8). See also Extended Data Figs 5-7 for detailed 
association analyses involving C4 locus structures and HLA alleles. 


Extended Data Fig. 9a) and a subset of astrocytes. Much of the C4 
immunoreactivity was punctate (Fig. 6b), co-localizing with synaptic 
puncta identified by co-immunostaining for the pre- and postsynaptic 
markers VGLUT1/2 (also known as SLC17A7 and SLC17A6, respec- 
tively) and PSD-95 (also known as DLG4) (Fig. 6b). These results sug- 
gest that C4 is produced by, or deposited on, neurons and synapses. 

To further characterize neuronal C4, we cultured human primary 
cortical neurons and evaluated C4 expression, localization, and 
secretion. Neurons expressed C4 mRNA and secreted C4 protein 
(Extended Data Fig. 9c). Neurons exhibited C4-immunoreactive 
puncta along their processes and cell bodies (Fig. 6c, d; antibody 
specificity further evaluated in Extended Data Fig. 9b). About 75% 
of C4 immunoreactivity localized to neuronal processes (Fig. 6c); 
of the C4 in neuronal processes, approximately 65% was observed 
in dendrites (MAP2*, neurofilament* processes) and 35% in axons 
(MAP2~, neurofilament* processes). Punctate C4 immunoreactivity 
was observed at 48% of structural synapses as defined by co-localized 
synaptotagmin and PSD-95 (Fig. 6d). 

The association of increased C4 with schizophrenia (Figs 4 and 5), 
the presence of C4 at synapses (Fig. 6b, d), the involvement of other 
complement proteins in synapse elimination’”*', and earlier reports of 
decreased synapse numbers in schizophrenia patients*°, together sug- 
gested that C4 might work with other components of the classical com- 
plement cascade to promote synaptic pruning. To test this hypothesis, 
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Figure 5 | C4 structures, C4A expression, and schizophrenia risk. a, 
Schizophrenia risk associated with four common structural forms of C4 
in analysis of 28,799 schizophrenia cases and 35,986 controls. b, Brain 
C4A RNA expression levels associated with four common structural 
forms of C4. 8 was calculated from fitting C4A RNA expression (in 
brain tissue) to the number of chromosomes (0, 1, or 2) carrying each 
C4 structure (across 120 individuals sampled). ¢, Schizophrenia risk 
associated with 13 combinations of C4 structural allele and MHC SNP 
haplotype. The numbers on the y axis adjacent to the C4 structures 
indicate the ‘haplogroup, the MHC SNP haplotype background on which 
the C4 structure segregates, and correspond to Fig. 2. Statistical tests of 
heterogeneity yielded P=0.55 for AL-AL alleles; P= 0.93 for AL-BL 
alleles; P= 0.06 for AL-BS alleles; and P=5.7 x 10~° across the overall 
allelic series. d, Expression levels of C4A RNA were directly measured 
(by RT-ddPCR) in post-mortem brain samples from 35 schizophrenia 
patients and 70 individuals not affected with schizophrenia. Measurements 
for all five brain regions analysed exhibited the same relationship 
(Extended Data Fig. 8). Horizontal lines show the median value for 
each group. P values were derived by a (non-parametric) one-sided 
Mann-Whitney test. Error bars shown in a-c represent 95% confidence 
intervals around the effect size estimate. 


we moved to a mouse model. C4A and C4B appear to have functionally 
specialized outside the rodent lineage, but the mouse genome contains 
a C4 gene that shares features with both C4A and C4B (Extended Data 
Fig. 10a, b). Impairments in schizophrenia tend to affect higher cogni- 
tive functions and recently expanded brain regions for which analogies 
in mice are uncertain*”. However, waves of postnatal synapse elimi- 
nation occur in many brain regions, and strong experimental models 
have been established in several mammalian visual systems. In these 
systems, synaptic projections from retinal ganglion cells (RGCs) onto 
thalamic relay neurons within the dorsal lateral geniculate nucleus 
(dLGN) of the visual thalamus undergo activity-dependent synaptic 
refinement””-3433-*5, We found that C4 RNA was expressed in the LGN 
and in RGCs purified from the retina during the period of synaptic 
pruning (Extended Data Fig. 10c). 

In the immune system, C4 promotes C3 activation, allowing C3 to 
covalently attach onto its targets and promote their engulfment by phago- 
cytic cells. In the developing mouse brain, C3 targets subsets of synapses 
and is required for synapse elimination by microglia, the principal CNS 
cells expressing receptors for complement”””. We found that in mice 
deficient in C4 (ref. 36), C3 immunostaining in the dLGN was greatly 
reduced compared to wild-type littermates (Fig. 7a, b), with fewer syn- 
aptic inputs being C3-positive in the absence of C4 (Fig. 7c). These data 
demonstrate a role for C4 in complement deposition on synaptic inputs. 

We then evaluated whether mice deficient in C4 had defects in 
synaptic remodelling, as has been described for C3-deficient mice”’. 
Mice lacking functional C4 exhibited greater overlap between RGC 
inputs from the two eyes (P< 0.001) than wild-type littermate controls, 
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Figure 6 | C4 protein at neuronal cell bodies, 
processes and synapses. a, C4 protein 
localization in human brain tissue. Two 
representative confocal images (drawn from 
immunohistochemistry performed on samples 
from five individuals with schizophrenia 

and two unaffected individuals) within 

the hippocampal formation demonstrate 
localization of C4 in a subset of NeuN* neurons. 
b, High-resolution structured illumination 
microscopy (SIM) imaging of tissue in the 
hippocampal formation reveals colocalization 
of C4 with the presynaptic terminal markers 
VGLUT1/2 and the postsynaptic marker 
PSD-95. c, Confocal images of primary human 
cortical neurons show colocalization of C4, 
MAP2, and neurofilament along neuronal 
processes. d, Confocal image of primary cortical 
neurons stained for C4, presynaptic marker 
synaptotagmin, and postsynaptic marker PSD-95. 
Scale bars, 251m (a, c, and d); 51m (b, left); 
and 11m (b, right). Extended Data Fig. 9 
contains additional data on antibody specificity. 


suggesting reduced synaptic pruning (Fig. 7d, Extended Data Fig. 
10d, e, and Supplementary Methods). The degree of deficit in 
C4-'- mice was similar to that previously reported for Clq and 
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Figure 7 | C4 in retinogeniculate synaptic refinement. a, Representative 
confocal images of immunohistochemistry for C3 in the P5 dLGN 
showed reduced C3 deposition in the dLGN of C4~/~ mice compared 

to wild-type (WT) littermates. b, Quantification confirmed reduced 

C3 immunoreactivity in the dLGN (n=3 mice per group, P< 0.05, 

t-test; y axis: mean fluorescence intensity, normalized to wild type). 

c, Co-localization analysis revealed a reduction in the fraction of 
VGLUT2* puncta that were C3* in C4-deficient mice relative to their 
WT littermates (n =3 mice per group, P= 0.0011, two-sided f-test). 

d, Synaptic refinement in mice with 0, 1, or 2 copies of C4. These images 
represent the segregation of ipsilateral and contralateral RGC projections 
to the dLGN; two analysis methods were used. Top, projections from 
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Hoechst 


Hoechst 


C3" mice”?*!. Heterozygous C4*/- mice, with one wild-type copy of 
C4, had an intermediate phenotype (Fig. 7d). These data provide direct 
evidence that C4 mediates synaptic refinement in the developing brain. 
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the ipsilateral (green) and contralateral (red) eyes show minimal overlap 
(yellow) in wild-type mice. The overlapping area is significantly increased 
in C4-'~ mice (n=6 mice per group, P< 0.01, ANOVA with Bonferroni 
post-hoc tests). Bottom, threshold-independent analysis using the R 
value*? (R =1ogio(Fipsi/ Feontra)). Pixels are pseudocoloured with an R value 
heat map (red indicates areas having only contralateral inputs; purple, only 
ipsilateral inputs). Compared to their wild-type littermates, C4-deficient 
mice exhibited lower R value variance, indicating defects in synaptic 
refinement (n= 6 mice per group, P< 0.001, ANOVA with Bonferroni 
post-hoc tests). Control experiments analysing total (LGN size, dLGN 
area receiving ipsilateral input, and number of RGCs are shown in 
Extended Data Fig. 10f-h, respectively. Error bars in b-d represent s.e.m. 
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Discussion 

We developed ways to analyse a complex form of genome structural 
variation (Figs 1 and 2) and discovered that schizophrenia’s association 
with variation in the MHC locus involves many common, structur- 
ally distinct C4 alleles that affect expression of C4A and C4B in the 
brain; each allele associated with schizophrenia risk in proportion to 
its effect on C4A expression (Figs 3-5). We found that C4 is expressed 
by neurons, localized to dendrites, axons, and synapses, and is secreted 
(Fig. 6). In mice, C4 promoted synapse elimination during the devel- 
opmentally timed maturation of a neuronal circuit (Fig. 7). 

In humans, adolescence and early adulthood bring extensive elim- 
ination of synapses in distributed association regions of cerebral cor- 
tex, such as the prefrontal cortex, that have greatly expanded in recent 
human evolution and appear to become impaired in schizophrenia*”~°. 
Synapse elimination in human association cortex appears to continue 
from adolescence into the third decade of life*’. This late phase of cor- 
tical maturation, which may distinguish humans even from some other 
primates*’, corresponds to the period during which schizophrenia 
most often becomes clinically apparent and patients’ cognitive function 
declines, a temporal correspondence that others have also noted*’. 

The principal pathological findings in brains of those affected with 
schizophrenia involve loss of cortical grey matter without cell death: 
affected individuals exhibit abnormal cortical thinning!” and reduced 
numbers of synaptic structures on cortical pyramidal neurons*. In the 
brain, complement receptors are expressed primarily by microglia, the 
phagocytic immune cells of the central nervous system. The possibility 
that neuron-microglia interactions via the complement cascade con- 
tribute to schizophrenia pathogenesis—for example, that schizophrenia 
arises or intensifies from excessive or inappropriate synaptic pruning 
by microglia during adolescence and early adulthood—would offer a 
potential mechanism for these longstanding observations about age of 
onset and synapse loss. Many other genetic findings in schizophrenia 
involve genes that encode synaptic proteins®*”"*. Diverse synaptic 
abnormalities could in principle interact with the complement system 
and other pathways*>“* to cause excessive stimulation of microglia and 
elimination of synapses during adolescence and early adulthood. 

The two human C4 genes (C4A and C4B) exhibited distinct rela- 
tionships with schizophrenia risk, with increased risk associating most 
strongly with variation that increases expression of C4A. Human C4A and 
C4B proteins, whose functional specialization appears to be evolutionarily 
recent (Extended Data Fig. 10a), show striking biochemical differences: 
C4A more readily forms amide bonds with proteins, while C4B favours 
binding to carbohydrate surfaces’, differences with an established basis 
in C4 protein sequence and structure*”’. An intriguing possibility is that 
C4A and C4B differ in affinity for an unknown binding site at synapses. 

To date, few genome-wide association study (GWAS) associations 
have been explained by specific functional alleles. An unexpected find- 
ing at C4 involves the large number of common, functionally distinct 
forms of the same locus that appear to contribute to schizophrenia risk. 
The human genome contains hundreds of other genes with complex, 
multi-allelic forms of structural variation”. It will be important to learn 
the extent to which such variation contributes to brain diseases and to 
all human phenotypes. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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Extended Data Figure 1 | Association of schizophrenia to common 
variants in the MHC locus in individual case-control cohorts, and 
schematic of the repeat module containing C4. a—f, Data for several 
schizophrenia case-control cohorts that were genome-scanned before 
we began this work (ad) exhibits peaks of association near chré: 

32 Mb (blue vertical line) on the human genome reference sequence 
(GRCh37/hg19). Note that association patterns vary from cohort to 
cohort, reflecting statistical sampling fluctuations and potentially 
fluctuations in allele frequencies of the (unknown) causal variants in 
different cohorts. Cohorts such as in b, e and f suggest the existence of 
effects at multiple loci within the MHC region. Even in the cohorts with 
simpler peaks (a, c, d), the pattern of association across the individual 
SNPs at chr6: 32 Mb does not correspond to the LD around any known 
variant. This motivated the focus in the current work on cryptic genetic 
influences in this region that could cause unconventional association 
signals that do not resemble the LD patterns of individual variants. 

g, A complex form of genome structural variation resides near chr6: 


32 Mb. Shown here are three of the known alternative structural forms 

of this genomic region. The most prominent feature of this structural 
variation is the tandem duplication of a genomic segment that contains a 
C4 gene, 3/ fragments of the STK19 and TNXB genes, and a pseudogenized 
copy of the CYP21A2 gene. This cassette is present in 1-3 copies on 

the three alleles depicted above; the boundaries below each haplotype 
demarcate the sequence that is duplicated. Haplotypes with multiple copies 
of this module (middle and bottom) contain multiple functional copies 

of C4, whereas the additional gene fragments or copies denoted STK19P, 
CYP21A2P, and TNXA are typically pseudogenized. Rare haplotypes with 
a gain or loss of intact CYP21A2 have also been observed!®. Although C4A 
and C4B contain multiple sequence variants, they are defined based on the 
differences encoded by exon 26, which determine the relative affinities of 
C4A and C4B for distinct molecular targets'*”° (Fig. 1). Many additional 
forms of this locus appear to have arisen by non-allelic homologous 
recombination and gene conversion (ref. 18 and Fig. 1). 
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Extended Data Figure 2 | Schematic of strategy for identifying the 
segregating structural forms of the C4 locus. a, Molecular assays for 
measuring copy number of the key, variable C4 structural features—the 
length polymorphism (HERV insertion) that distinguishes the long (L) 
from the short (S) genomic form of C4, and the C4A/C4B isotypic 
difference. Each primer-probe-primer assay is represented with the 
combination of arrows (primers) and asterisk (probe) in its approximate 
genomic location (though not to scale). b, Measurement of copy number 
of C4 gene types in the genomes of 162 individuals (from HapMap 

CEU sample). The absolute, integer copy number of each C4 gene type 
in each genome is precisely inferred from the resulting data. To ensure 
high accuracy, the data are further evaluated for a checksum relationship 
(A+B=L+S) and for concordance with earlier data from Southern 
blotting of 89 of the same HapMap individuals>’. c, To measure the copy 


Infer the C4 gene contents of the 
founder chromosomes in each trio. 
Where possible, use existing sequence 
data to also infer gene order. 


number of compound structural forms of C4 (involving combinations 

of L/S and A/B), we perform long-range PCR followed by quantitative 
measurement of the A/B isotype-distinguishing sequences in droplets. 

d, Analysis of transmissions in father-mother-offspring trios enables 
inference of the C4 gene contents of individual copies (alleles) of 
chromosome 6. Three example trios are shown in this schematic. 

e, Examples of the inferred structural forms of the C4 locus (more shown 
in Fig. 1c). For the common C4 structures (AL-BL, AL-BS, AL-AL, and 
BS), genomic order of the C4 gene copies is known from earlier assemblies 
of sequence contigs in individuals homozygous for MHC haplotypes due 
to consanguinity’” and other molecular analyses of the C4 locus'®. For the 
rarer C4 structures, the genomic order of C4 gene copies is hypothesized 
or provisional. 
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of SNPs in the MHC locus with copy number of C4 gene types (a) and reflecting the relationship shown in Fig. 2, in which a structural form of 
larger-scale structural forms (haplotypes) (b) of the C4 locus. Dashed, the C4 locus often segregates on multiple different SNP haplotypes. 
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Extended Data Figure 4 | RNA expression of C4A and C4B in relation 
to copy number of C4A, C4B, and the C4-HERV (long form of C4), in 
eight panels of post-mortem brain tissue. Copy number of C4 structural 
features was measured by ddPCR; RNA expression levels were measured 
by RT-ddPCR. a-e, Data for tissues from the Stanley Medical Research 
Institute (SMRI) Array Consortium consisting of anterior cingulate cortex (a), 
cerebellum (b), corpus callosum (c), orbital frontal cortex (d), and 
parietal cortex (e). f, Data for the frontal cortex samples from the NHGRI 
Genes and Tissues Expression (GTEx) Project. g, h, Data for tissues from 
the SMRI Neuropathology Consortium (anterior cingulate cortex and 
cerebellum, respectively). These data were then used to inform (by linear 
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regression) the derivation of a linear model for predicting each individual’s 
RNA expression of C4A and C4B as a function of the numbers of copies 

of AL, BL, AS, and BS. The derivation of this model, and the regression 
coefficients induced, are described in Supplementary Methods. In the 
rightmost plot of each panel, expression of C4A (per genomic copy) is 
normalized to expression of C4B (per genomic copy) to more specifically 
visualize the effect of the C4-HERV by controlling for genomic copy 
number and for any trans-acting influences shared by C4A and C4B; the 
inferred regression coefficients (Supplementary Methods) suggest that the 
observed effect is mostly due to increased expression of C4A. 


C4B copy 
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Association of SCZ with 
Association of SCZ with this Association of SCZ with this genetically predicted C4A 


variant variant expression 
ter ; (joint analysis with predicted (joint analysis with this variant 
(unsonditiened analysis) C4A expression as a covariate) as a covariate) 
p B SE p B SE p B SE 


Expression predictors (derived from post mortem brain RNA analysis) 


C4A expression (genetic 
prediction from imputed C4 
locus structure) 3.60E-24 0.247 0.024 NA NA NA NA NA NA 


C4B expression (genetic 
prediction from imputed C4 
locus structure) 2.31E-07 -0.092 0.018 0.71 0.008 0.021] 2.35E-18 0.253 0.029 


Copy number of C4 structural features 


C4A genes 6.31E-19 0.112 0.013 0.45 = -0.023 0.030] 6.54E-07 0.287 0.058 
C4B genes 1.45E-04 -0.066 0.017 0.63 0.009 0.019] 4.58E-21 0.252 0.027 
Long C4 genes (with HERV) 7.09E-23 0.084 0.009 0.17 0.029 0.021] 4.89E-03 0.170 0.060 
Short C4 genes (no HERV) 3.94E-14 -0.085 0.011 0.32 -0.015 0.015] 7.38E-12 0.225 0.033 
Total C4 copy number 9.41E-15 0.123 0.016 1.00 0.000 0.025] 5.40E-11 0.247 0.038 


Specific C4 locus structures (haplotypes of one or more C4 genes) 


BS 2.29E-19 -0.171 0.019 0.16 -0.045 0.032] 1.10E-06 0.200 0.041 
AL-BS 0.03 -0.027 0.013 0.62 0.007 0.013] 3.39E-23 0.250 0.025 
AL-BL 6.16E-07 0.058 0.012 0.41 0.011 0.013] 6.34E-19 0.238 0.027 
AL-AL 2.09E-06 0.093 0.020 0.56 -0.013 0.023] 2.49E-19 0.256 0.028 


Combinations of C4 locus structure and flanking SNP haplotype 


AL-BS-4 0.21 -0.027 0.022 0.95 -0.002 0.022] 7.99E-24 0.247 0.025 
AL-BS-2 0.35 -0.031 0.033 0.80 -0.008 0.033] 5.45E-24 0.247 0.024 
AL-BS-3 0.34 -0.028 0.029 0.91 -0.003 0.030] 5.69E-24 0.247 0.024 
AL-BS-5 0.07 0.053 0.029 0.01 0.077 0.029] 5.73E-25 0.252 0.024 
AL-BS-1 0.04 -0.064 0.031 0.19 -0.040 0.031] 1.32E-23 0.245 0.024 
AL-BS-other 0.30 -0.023 0.022 0.92 0.002 0.023] 6.19E-24 0.247 0.025 
AL-BL-2 0.03 0.036 0.017 0.77 0.005 0.017] 3.52E-23 0.246 0.025 
AL-BL-3 2.15E-03 0.042 0.014 0.55 0.008 0.014] 3.28E-22 0.243 0.025 
AL-BL-1 0.35 0.032 0.034 0.92 0.003 0.035] 5.55E-24 0.247 0.024 
AL-BL-other 0.23 0.029 0.024 0.95 0.002 0.024] 7.37E-24 0.247 0.025 
AL-AL-1 0.01 0.074 0.028 0.38 -0.026 0.029] 9.42E-23 0.255 0.026 
AL-AL-2 3.33E-04 0.097 0.027 0.94 -0.002 0.029] 2.31E-21 0.248 0.026 


Alternative hypotheses 


C4B nulls (for whom total C4B 


copy number = 0) * 0.36 0.061 0.066 
Extended Data Figure 5 | Detailed analysis of the association of and 330 controls in which rare C4B-null status associated with elevated 
schizophrenia to genetic variation at and around C4, in data from risk of schizophrenia, though two subsequent studies? found no 
28,799 schizophrenia cases and 35,986 controls. (Psychiatric Genomics association of schizophrenia to C4B-null genotype. We sought to evaluate 
Consortium, ref. 6.) SCZ, schizophrenia; 8, estimated effect size per copy this using the large data set in this study, finding no association to 
of the genomic feature or allele indicated; SE, standard error. Detailed C4B-null status. The double asterisk (**) indicates total copy number 
association analyses of HLA alleles are in Extended Data Figs 6 and 7. of C4 is also strongly correlated to copy number of the CYP21A2P 
The single asterisk (*) indicates that we specifically tested C4B-null status pseudogene, which is present on duplicated copies of the sequence shown 


because a 1985 study” reported an analysis of 165 schizophrenia patients in Extended Data Fig. 1g. 
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Association p-value 


Genetically 
Variants included in analysis rs13194504 predicted C4A B*0801 DRB71*0301 DQB1*02 
(C4Aexp) 

1813194504 5.50E-28 

C4Aexp 3.60E-24 

B*0801 1.20E-21 

DRB1*0301 6.40E-19 

DQB1*02 6.25E-17 

1813194504, C4Aexp 8.00E-14 7.80E-10 

1813194504, B*0801 5.75E-11 2.00E-04 

rs13194504, DRB1*0301 2.89E-15 6.20E-06 

1813194504, DQB1*02 9.60E-19 1.82E-07 

1813194504, C4Aexp, B*0801 6.94E-11 8.23E-07 0.96 

1813194504, C4Aexp, DRB1*0301 1.39E-12 2.18E-05 0.41 

1813194504, C4Aexp, DQB1*02 3.06E-13 1.03E-04 0.03 
Extended Data Figure 7 | Detailed association analysis for the most representing the distinct signal several megabases to the left (Fig. 4). 
strongly associating classical HLA alleles. The most strongly associating In joint analyses with each of these HLA alleles, genetically predicted 
HLA loci were HLA-B (in primary analyses, Fig. 4a and Extended Data C4A expression and rs13194505 continued to associate strongly with 
Fig. 6a) and HLA-DRB1 and HLA-DQBI (in analyses controlling for the schizophrenia, while the HLA alleles did not. In further joint analyses 


signal defined by rs13194504, Fig. 4c and Extended Data Fig. 6b). Atthese — with rs13194504 and genetically predicted C4A expression, 0 of 2,514 
loci, the most strongly associating classical HLA alleles were HLA-B*0801, tested HLA SNP, amino acid and classical-allele polymorphisms (from 


HLA-DRB1*0301, and HLA-DQB*02, respectively. These HLA alleles ref. 55, including all variants with minor allele frequency (MAF) >0.005) 
are all in strong but partial LD with C4 BS, the most protective of the C4 associated with schizophrenia as strongly as rs13194504 or predicted C4A 
alleles; they are also in partial LD with the low-risk allele at rs13194505, expression did. 
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Extended Data Figure 8 | Expression of C4A RNA in brain tissue (five brain regions) from 35 schizophrenia cases and 70 non-schizophrenia 
controls, from the Stanley Medical Research Institute Array Consortium. C4A RNA expression levels were measured by ddPCR. P values are derived 
from Mann-Whitney U-test. 
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Extended Data Figure 9 | Secretion of C4, and specificity of the images demonstrate the loss of immunoreactivity in the secondary-only 
monoclonal anti-C4 antibody for C4 protein in human brain tissue and —_and pre-adsorbed conditions. Scale bars, 25|1m. c, Secretion of C4 protein 
cultured primary cortical neurons. a, Brain tissue (from an individual by cultured primary neurons. Western blot for C4 protein analysis. 
affected with schizophrenia) was stained with a fluorescent secondary (+) Purified human C4 protein. (-) Unconditioned medium, a negative 
antibody, C4 antibody, or C4 antibody that was pre-adsorbed with purified control. HN-conditioned shows the same medium after conditioning by 
C4 protein. Confocal images demonstrate the loss of immunoreactivity cultured human neurons at days 7 (d7) and 30 (d30). Details of western 
in the secondary-only and pre-adsorbed conditions. b, Primary human blot protocol, antibody catalogue numbers and concentrations used are in 
neurons were stained with a fluorescent secondary antibody, C4 antibody Supplementary Methods. C4 molecular weight, ~210 kDa. 


or C4 antibody that was pre-adsorbed with purified C4 protein. Confocal 
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Extended Data Figure 10 | Mouse C4 genes and additional analyses of 
the dLGN eye segregation phenotype in C4 mutant mice and wild-type 
and heterozygous littermate controls. a, The functional specialization 
of C4 into C4A and C4B in humans does not have an analogy in mice. 
Although the mouse genome contains both a C4 gene and a C4-like 

gene (classically called Sip), and these genes are also present as a tandem 
duplication within the mouse MHC locus, analysis of the encoded 
protein sequences indicates a distinct specialization, as illustrated by the 
protein phylogenetic tree. Top, mouse Sip is indicated in grey to reflect 

its potential pseudogenization: Slp is already known to have mutations 

at a Cls cleavage site, which are thought to abrogate activation of the 
protein through the classical complement pathway”; and the M. musculus 
reference genome sequence (mm10) at S/p shows a 1-bp deletion (relative 
to C4) within the coding region at chr17:34815158, which would be 
predicted to cause a premature termination of the encoded protein. 

In some genome data resources, mouse S/p and C4 have been annotated 
respectively as ‘C4a’ (for example, NM_011413.2) and ‘C4b’ (for example, 
NM_009780.2) based on synteny with the human C4A and C4B genes, 
but the above sequence analysis indicates that they are not paralogous 

to C4A and C4B. b, Sequence differences between C4A and C4B—which 
are otherwise 99.5% identical at an amino acid level—are concentrated 

at the ‘isotypic site’ where they shape each isotype’s relative affinity for 
different molecular targets!””°. At the isotypic site, mouse C4 contains a 
combination of the residues present in human C4A and C4B. c, Expression 


RGC # 
per 63X field 


WT C4-1- 


WT 


of mouse C4 mRNA in whole retina and lateral geniculate nucleus (LGN) 
from P5 animals and in purified retinal ganglion cells (RGCs) from P5 and 
P15 animals. These time points were chosen as P5 is a time of more robust 
synaptic refinement in the retinogeniculate system compared to P15. The 
same assays detected no C4 RNA in control RNA isolated from C4~/~ mice 
(not shown). n =3 samples for p5 retina, LGN, and P15 RGCs, n=4 
samples for P5 RGCs; *P < 0.05 by ANOVA with post-hoc Tukey-Kramer 
multiple-comparisons test. d, Representative images of dLGN innervation 
by contralateral projections (red in bottom image), ipsilateral projections 
(green in bottom image), and their overlap (yellow in bottom image). 
Scale bar, 100|1m. e, Quantification of the percentage of total dLGN area 
receiving both contralateral and ipsilateral projections shows a significant 
increase in C4~/~ compared to wild-type littermates (ANOVA, n=5 mice 
per group, P< 0.01). These data are consistent with results using R value 
analysis as shown in Fig. 7. f, Quantification of total (LGN area showed 
no significant difference between wild-type and C4~/~ mice (ANOVA, 
n=5 per group, P > 0.05). g, Quantification of dLGN area receiving 
ipsilateral innervation showed a significant increase in ipsilateral territory 
in the C4-/~ mice compared to wild-type littermates (AVOVA, n=5 mice 
per group, P > 0.01). This result is consistent with defects in eye specific 
segregation. Scale bar, 100 1m. h, The number of RGCs in the retina was 
estimated by counting the number of Brn3a° cells in wild-type and C4-/~ 
mice. No differences were observed between wild-type and C4~/~ mice 

(f test, n= 4 mice per group, P > 0.05). Scale bar, 100 1m. 
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Naturally occurring p16'"***- positive 
cells shorten healthy lifespan 


Darren J. Baker!, Bennett G. Childs*, Matej Durik!, Melinde E. Wijers!, Cynthia J. Sieben?, Jian Zhong', Rachel A. Saltness!, 
Karthik B. J eganathan!, Grace Casaclang Verzosa*®, Abdulmohammad Pezeshki*, Khashayarsha Khazaie*, 
Jordan D. Miller? & Jan M. van Deursen!? 


Cellular senescence, a stress-induced irreversible growth arrest often characterized by expression of p16!"** (encoded 
by the Ink4a/Arf locus, also known as Cdkn2a) and a distinctive secretory phenotype, prevents the proliferation of 
preneoplastic cells and has beneficial roles in tissue remodelling during embryogenesis and wound healing. Senescent 
cells accumulate in various tissues and organs over time, and have been speculated to have a role in ageing. To explore the 
physiological relevance and consequences of naturally occurring senescent cells, here we use a previously established 
transgene, INK-ATTAC, to induce apoptosis in p16!"**?- expressing cells of wild-type mice by injection of AP20187 
twice a week starting at one year of age. We show that compared to vehicle alone, AP20187 treatment extended median 
lifespan in both male and female mice of two distinct genetic backgrounds. The clearance of p16!"**?_ positive cells 
delayed tumorigenesis and attenuated age-related deterioration of several organs without apparent side effects, including 
kidney, heart and fat, where clearance preserved the functionality of glomeruli, cardio-protective Karp channels and 
adipocytes, respectively. Thus, p16'"*4?- positive cells that accumulate during adulthood negatively influence lifespan 
and promote age-dependent changes in several organs, and their therapeutic removal may be an attractive approach to 


extend healthy lifespan. 


Cellular senescence is a well-established cancer defence mechanism 
that has also been proposed to have roles in ageing and age-associated 
diseases, presumably through the depletion of stem and progenitor 
cells, and the adverse actions of the senescence-associated secretory 
phenotype, which consists of many proinflammatory cytokines and 
chemokines, matrix metalloproteinases and growth factors!-3, 
Consistent with this idea is the observation that interference with 
senescent cell accumulation in BubR1 progeroid mice delays several of 
the ageing-associated disorders that these animals develop*’. However, 
because progeroid syndromes do not mimic the complex degenera- 
tive changes of ageing completely, the relevance of these findings has 
remained unclear. Furthermore, recent studies showing that senescent 
cells have beneficial effects in injury repair and tissue remodelling® '° 
have called into question the simplistic view of senescence as only a 
driver of age-dependent pathologies, raising the specter that senes- 
cent cell clearance might remove useful cells in addition to detrimental 
ones. Here we investigated the identity and physiological effect of nat- 
urally occurring senescent cells using INK-ATTAC (hereafter termed 
ATTAC)‘, a transgenic mouse model that expresses the FK506-binding- 
protein-caspase 8 (FKBP-Casp8) fusion protein and green fluorescent 
protein (GFP) under the control of a minimal Ink4a (also known as 
Ink4a/Arf or Cdkn2a) promoter fragment transcriptionally active in 
senescent cells*!!. Earlier we have shown that, in BubR1 progeroid 
mice, ATTAC ablates p16!*“*-positive senescent cells upon adminis- 
tration of AP20187 (AP), a dimerizer that activates FKBP-fused Casp8 
(ref. 4). Our first objective was to validate the properties of ATTAC in 
naturally occurring p16'™*“*_positive senescent cells. 


ATTAC clears senescent adipocyte progenitor cells 
Our initial validation was focused on fat. We collected GFP* and 
GFP cell populations from inguinal white adipose tissue (1WAT) of 


12-month-old ATTAC mice by FACS (Fig. 1a). GFP* cells expressed 
much higher levels of Ink4a and FKBP-Casp8 than GFP cells, as well 
as a broad panel of senescence markers (Fig. 1b). GFP* cells, but not 
GFP cells, were also highly positive for senescence-associated-}- 
galactosidase (SA-B-Gal; Fig. 1c). Furthermore, intact iWAT from 
aged, but not young AT'TAC mice had SA-}-Gal activity, but less than 
iWAT of BubR1 progeroid mice, a difference also reflected in Ink4a, 
FKBP-Casp8 and GFP transcript levels (Extended Data Fig. 1a, b). 

iWAT of 18-month-old ATTAC mice treated bi-weekly with AP from 
12 months onwards had eightfold less GFP* adipocyte progenitors 
than vehicle-injected controls, although total progenitor cell numbers 
remained unchanged (Fig. 1d and Extended Data Fig. 1c). SA-6-Gal 
staining and quantitative reverse transcriptase PCR (qRT-PCR) analy- 
sis of senescence markers confirmed that Ink4a-positive senescent cells 
in iWAT increased between 12 and 18 months, and that AP eliminated 
most of these cells (Fig. le and Extended Data Fig. 1d). Consistent 
with senescence of progenitor cells, transmission electron microscopy 
(TEM) on SA-8-Gal-stained iWAT showed that X-Gal crystals were 
present in small perivascular cells rather than endothelium, white blood 
cells or adipocytes (Fig. 1f). X-Gal crystals were found in 0.2% and 
1.6% of total iWAT cells from AP-treated and control mice, respectively 
(Fig. 1g). 

Clearance of Ink4a-positive cells prevented loss of fat mass occur- 
ring between 12 and 18 months (Fig. 1h, iand Extended Data Fig. le). 
Age-dependent fat tissue dysfunction is characterized by decreased 
adipogenesis and adipocyte atrophy’. Consistent with this, adipocyte 
size decreased between 12 and 18 months of age (Fig. 1j), as did tran- 
script levels of two key transcriptional regulators of adipogenesis, Pparg 
and Cebpa (Fig. 1k). AP treatment of ATTAC mice prevented these 
decreases. Collectively, these data indicate that senescence contributes 
to age-dependent fat tissue alterations. 


1Department of Pediatric and Adolescent Medicine, Mayo Clinic College of Medicine, Rochester, Minnesota 55905, USA. Department of Biochemistry and Molecular Biology, Mayo Clinic College 
of Medicine, Rochester, Minnesota 55905, USA. 3Division of Cardiovascular Surgery, Mayo Clinic College of Medicine, Rochester, Minnesota 55905, USA. “Department of Immunology, Mayo Clinic 
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Figure 1 | Clearance of senescent fat progenitor cells attenuates 
age-related lipodystrophy. a, FACS profiles of single-cell suspensions 
from iWAT of the indicated 12- and 24-month-old (m) mice. b, GFP* 
and GFP~ cell populations from iWAT of 12-month-old ATTAC mice (see 
a for sorting brackets) analysed by qRT-PCR (n =6 mice). p21 is also 
known as Cdkn1a; Pail is also known as Serpine1. c, SA-3-Gal activity in 
GFP* and GFP” iWAT cells (n= 3 mice). d, GFP* cells in the indicated 
iWAT (IAT) cell populations (‘rest’ represents the iWAT vascular stromal 
fraction minus leukocytes, endothelial cells and progenitors). e-i, Fat- 
related analyses of C57BL/6 ATTAC mice before treatment (12 months) 
or after 6 months of treatment with vehicle (18 months —AP) or AP 

(18 months +AP). e, SA-8-Gal activity in iWAT and epididiymal WAT 
(eWAT). f, Electron micrograph showing perivascular X-Gal-positive 
cells from an 18-month-old vehicle-treated C57BL/6 ATTAC male. 

A, adipocyte; C, capillary. Arrows mark endothelial cells. g, Quantification 
of iWAT cells containing X-Gal crystals (n = 4 mice per treatment). h, Fat 
mass measurements. i, iWAT and eWAT depot weights. j, Mean adipocyte 
diameters in iWAT. k, Expression of adipogenesis markers in iWAT (n= 4 
mice per group). Scale bars, 101m (c), 0.5 cm (e), 2j1m (f) and 200 nm 

(f, inset). Error bars indicate s.e.m. *P < 0.05; **P< 0.01; ***P < 0.001 
(one-sample t-tests using a theoretical mean of 1 (b), and unpaired 
two-tailed t-tests (c, d, g-k)). 


Adipose tissue of young ATTAC mice lacked SA-B-Gal activity but 
contained p16'"* (Extended Data Fig. la, f-h). This p16" pool did 
not decline after AP treatment. Similar results were obtained with early 
passage AT TAC MEFs (Extended Data Fig. 1i-k), indicating that base- 
line Ink4a promoter activity in non-senescent cells is insufficient for 
FKBP-Casp8-mediated apoptosis. Furthermore, ATTAC lacks Ink4a 
promoter elements required for transgene induction in replicating 
pre-neoplastic cells that robustly express endogenous Ink4a owing to Rb 
inactivation and AP fails to kill these cells (Extended Data Fig. 2a-g). 
ATTAC was also not induced in peripheral blood T lymphocytes that 
robustly engage endogenous Ink4a with ageing without concomitant 
expression of several senescence markers”!? (Extended Data Fig. 2h), 
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Figure 2 | Senescent cell clearance extends lifespan. a, Study design 

for clearance of senescent cells in mixed and C57BL/6 mouse cohorts. 
Healthspan analysis was done at 18 months, an age at which relatively few 
mice in vehicle (Veh.)- or AP-treated groups have died, and bias owing 

to selection for long-lived animals is unlikely. b, c, Survival curves for 
vehicle-treated (—AP) and AP-treated (+AP) mixed (b) and C57BL/6 
(C57; c) mice. Median survival (in days, d) and percentage increase in 
median survival are indicated. We note that median lifespans of our 
vehicle-treated cohorts are similar to those of wild-type mice administered 
AP (c). *P < 0.05; **P < 0.01; ***P < 0.001 (log-rank tests). 


further indicating that transgene induction is selective for senescence. 
However, these limited analyses certainly do not exclude the possibility 
that other Ink4a-positive non-senescent cells engage ATTAC and die 
after AP exposure (Supplementary Information). 


Clearance by ATTAC is partial and tissue selective 

To extend our analysis of the properties of ATTAC, we analysed the 
transcript levels of Ink4a, FKBP-Casp8, GFP and a panel of senescence 
markers in a broad spectrum of tissues of 2-, 12- and 18-month-old 
untreated ATTAC mice, including skeletal muscle, eye, kidney, lung, 
heart, liver, colon and spleen. ATTAC mice treated with AP between 
12 and 18 months were included to assess senescent cell clearance rates. 
Ink4a expression markedly increased between 12 and 18 months in all 
tissues examined, which coincided with the induction of FKBP-Casp8, 
GFP and several senescence markers (Extended Data Fig. 3a). Increased 
expression of these transcripts was blunted to varying degrees by AP 
treatment in all tissues examined except colon and liver, indicating that 
the ATTAC system eliminates Ink4a-positive senescent cells in a partial 
and tissue/organ selective fashion. Blunting of age-dependent increases 
in II6, [11a and Tnfa expression in fat, skeletal muscle and kidney by 
AP treatment (Extended Data Fig. 3b) suggests that Ink4a-positive cell 
clearance ameliorates inflammation in these tissues. 


pl6'"*42+ cells shorten lifespan and healthspan 

To examine the effect of p16'™**-positive cell clearance on health and 
lifespan, we sequentially established two cohorts of ATTAC trans- 
genic mice (Fig. 2a). The initial cohort was on a C57BL/6-129Sv-FVB 
mixed genetic background fed a diet containing 9% fat. We note that 
this diet shortens lifespan compared to diets with 5% fat typically 
used in lifespan studies (Extended Data Fig. 4a, b and Supplementary 
Information). The later cohort was on a congenic C57BL/6 background 
fed a standard 5% fat diet. At 12 months of age, when plomsa+ cells 
started to accumulate in several tissues (Extended Data Figs 1d and 3a), 
mice were injected twice a week with AP or vehicle until they became 
moribund or died of natural causes. Mice separate from the longevity 
cohorts were examined for a series of age-sensitive outcomes at 
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Figure 3 | Clearance of senescent cells prolongs healthspan. a, Survival 
curves of ATTAC mice dying of cancer (mice that had an overt tumour 

at time of death; mice with lymphomas, sarcomas and carcinomas were 
included, mice without tumours were censored). Median survival and 
percentage increase are indicated. b, Representative images of aged mice 
with and without senescent cell clearance. c, Spontaneous movement and 
exploratory behaviour of ATTAC mice analysed by the open field test. 
Error bars indicate s.e.m. *P < 0.05; **P < 0.01; ***P < 0.001 (log-rank 
test (a) and unpaired two-tailed t-test (c)). 


18 months, an age at which relatively few mice in each of the cohorts 
had died. Data for both sexes combined showed that median lifespans 
of mixed and C57BL/6 AP-treated animals were increased by 27% and 
24%, respectively. Median lifespans for each sex separately were also 
significantly extended in AP-treated cohorts irrespective of genetic 
background, with increases ranging from 17% to 35% (Fig. 2b, c). 

Maximum lifespan’ was significantly increased for mixed AP-treated 
males and females combined (P= 0.0295), but not for females and 
males individually. Maximum lifespan was not extended for C57BL/6 
AP-treated animals, either combined or separately. Importantly, AP 
treatment of mice lacking the ATTAC transgene did not improve 
lifespan (Fig. 2c). We note that the median lifespan of vehicle- 
treated C57BL/6 males, but not females, was short of the normal range 
of lifespans for unmanipulated males of this strain at different lab- 
oratories'*-*8 (Extended Data Fig. 4c, d), suggesting that repetitive 
vehicle injection stress may have negatively affected C57BL/6 male 
longevity (Supplementary Information). 

In both cohorts, AP treatment had no effect on the incidence or 
spectrum of macroscopically detectable tumours at autopsy, although 
tumour latency was increased (Fig. 3a and Extended Data Fig. 5a-d). 
Median lifespan extensions of AP-treated mice dying without tumours 
ranged from 24% to 42% (Extended Data Fig. 5e), indicating that 
increased longevity was not merely due to a tumour-protective effect. 
AP-treated mice were overtly indistinguishable from vehicle-injected 
littermates at 18 months of age, but typically had a healthier appearance 
by 22 months (Fig. 3b). AP treatment delayed cataract formation in 
both males and females on a C57BL/6 background (Extended Data 
Fig. 5f, g). Despite a lack of overt difference at 18 months, AP-treatment 
prevented age-dependent reductions in both spontaneous activity and 
exploratory behaviour measured by open-field testing (Fig. 3c), which 
was independent of sex and genetic background. 

Extended tests on these mice showed no differences in motor coor- 
dination and balance, memory, exercise ability or muscle strength 
(Extended Data Fig. 6), indicating that 6 months of p16'"““*-positive 
cell clearance did not affect these age-sensitive traits. Similarly, age- 
related changes in the circulating haematopoietic profile, including 
regulatory T-cell levels, were also not affected (Extended Data Fig. 7). 
At 18 months, fasting glucose levels, glucose tolerance, insulin sensitiv- 
ity and circulating Igf1 levels were not significantly different in AP- and 
vehicle-treated C57BL/6 ATTAC females, as was signalling downstream 
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Figure 4 | Senescent cells cause glomerulosclerosis, kidney dysfunction 
and renal RAAS hyperactivity. a, Images of sclerotic (left) and normal 
(right) glomeruli from the indicated mice. H&E, haematoxylin and eosin; 
PAS, periodic acid-Schiff. b, Quantification of sclerotic glomeruli. 

c, Measurements of blood urea nitrogen levels. d, SA-3-Gal-stained kidney 
sections. e, Electron micrograph showing a X-Gal crystal-containing renal 
epithelial cell with brush border membrane (arrowheads). Insets show 
X-Gal crystal close-ups. f, Percentage of cells with X-Gal crystals in renal 
sections (n = 5 TEM grids for each treatment group). g, Renal expression 
of Agtrla analysed by qRT-PCR (n = 4 mice per group). h, Western blot 

of kidney lysates probed for Agtrla (n= 3 mice per treatment group). 
Ponceau S staining served as loading control. i, Immunostaining of kidney 
sections for Agtrla. Yellow circles denotes glomeruli. Scale bars, 50 zm 

(a and i, bottom), 250m (d), 541m (e), 200 nm (e, insets) and 100,1m 

(i, top). Error bars indicate s.e.m. *P < 0.05; **P< 0.01; ***P< 0.001 
(unpaired two-tailed t-tests). For gel source data, see Supplementary Fig. 1. 
All mice in d-i were C57BL/6 ATTAC. 


of Igfl and insulin receptors in fat, kidney and muscle, three tissues 
where we observed clearance of p16'™“** cells (Extended Data Fig. 8). 


Renal epithelial cell senescence and glomerulosclerosis 

To further investigate the effect of p16™** cells on physiological func- 
tions that change with age”’, we focused on kidney and heart, two vital 
organs in which we observed ATTAC-mediated clearance. Aged kid- 
neys are characterized by the formation of sclerotic glomeruli, which 
affect glomerular filtration rates, impair kidney function, and lead to 
increased blood urea nitrogen levels*°*!. Indeed, glomerulosclerosis 
significantly increased between 12 and 18 months in vehicle-treated 
ATTAC mice. AP treatment markedly reduced glomerulosclerosis 
independent of sex and genetic background (Fig. 4a, b), which cor- 
related with attenuated age-related increases in blood urea nitrogen 
(Fig. 4c), indicating preserved kidney function. SA-8-Gal staining of 
kidney sections confirmed that A P-mediated disposal of senescent cells 
was quite efficient (Fig. 4d). By TEM, we observed X-Gal crystals in 
0.3% and 1.2% of renal cells of 18-month-old treated and untreated 
mice, respectively (Fig. 4e, f). Surprisingly, SA-6-Gal* cells were not 
located in the glomeruli but in proximal tubules (Fig. 4e), raising the 
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question as to how senescent tubular brush-border epithelial cells might 
promote glomerulosclerosis. 

Angiotensin receptor blockers attenuate age-related glomerulosclerosis 
under normotensive conditions, which has led to the idea that overacti- 
vation of the local renin-angiotensin-aldosterone system (RAAS) drives 
the formation of sclerotic glomeruli*'**. Consistent with this, kidney 
transcript levels of a key component of this system, angiotensin receptor 
la (Agtr1a), increased between 12 and 18 months (Fig. 4g). By contrast, 
no such increase was observed in AP-treated mice. Western blotting 
confirmed that Agtr1la protein levels were lower in AP-treated kidney 
samples (Fig. 4h). Both renal tubules and glomeruli contributed to this 
decline, as demonstrated by immunolabelling of kidney sections for 
Agtrla (Fig. 4i). These data suggest that senescent renal epithelial cells 
may produce senescence-associated secretory phenotype components 
that hyperactivate the local RAAS in kidney. 


pl6™**3+ cells contribute to cardiac ageing 

Hearts of 12-month-old mice showed SA-3-Gal activity at the atrial and 
ventricular surface (Fig. 5a), which increased with ageing in vehicle- 
treated but not AP-treated mice. TEM revealed that ciliated epithe- 
lial cells and fibroblasts in the pericardium contained X-Gal crystals 


c 12m a d O12m2 Mi2m 2 
Hig8m #-AP Hism #-AP Hism 2 -AP 
B18m @+AP H18m #7#+AP 018m F +AP 

on 
o 12 
B= 10 = 
BS g ZE 
OG DME 
ge 
85 6 5B 
a3 oe 
Odi 2s 
oO ey 
Se 5 SE 
Bs 
a 6 
g h 
600 5 
6 
Be 500 4 
28 BS 
Pa ‘= 4004 
® S 
oe & o 
Ya 2 3004 
aS od 
Oo @ 
8 = = 2004 
ge F 
re) 100 4 


ARTICLE 


(Fig. 5b, c). We also observed SA-B-Gal* smooth muscle cells in the 
aortic root wall of vehicle-treated animals (Fig. 5a and Extended Data 
Fig. 9a). At 18 months, vehicle- and AP-treated ATTAC mice ana- 
lysed by echocardiography showed no significant differences in heart 
rate, left ventricular mass, thickness and diameter, ejection fraction 
and fractional shortening, all of which were also unchanged from 
12-month-old mice (Extended Data Fig. 9b-g). Cardiac ageing at the 
histological level is characterized by a loss of ventricular cardiomyo- 
cytes due to decreasing ability to replace apoptotic or necrotic cardio- 
myocytes, inducing hypertrophy of the remaining cardiomyocytes*’. 
Morphometric analysis of heart sections showed that ventricular wall 
thickness was the same in vehicle- and AP-treated animals, regardless 
of sex (Fig. 5d). Ventricular cardiomyocytes of AP-treated mice, how- 
ever, were much smaller (Fig. 5e, f), suggesting that they had more 
cardiomyocytes than vehicle-treated mice. Taken together, these data 
suggest that p16!"‘4*_positive cells are key drivers of this age-related 
cardiac phenotype. 

Cardiac ageing is also characterized by decreasing stress tolerance, 
which has been attributed to decreasing numbers of ATP-sensitive 
potassium (Karp) channels in the sarcolemma of cardiac myo- 
cytes due to age-related reductions in the expression of Sur2a, a key 
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Figure 5 | Senescent cells promote age-related cardiomyocyte 
hypertrophy and loss of cardiac stress tolerance. a, SA-}-Gal-stained 
hearts. Insets show aortic roots (ar) from a transverse plane (arrow marks 
the aortic root wall) or close-ups of the ventricular (v) and arterial (a) 
boxed areas. b, Electron micrographs of X-Gal-positive cells in the 
pericardium (red asterisk marks cilia). Insets show close-ups of X-Gal 
crystals. c, Quantification of cells with X-Gal crystals in the visceral 
pericardium (n=4 mice per treatment). d, Measurements of left ventricle 
wall thickness (n = 4 mice per group). e, Representative cardiomyocyte 


Twix | Bu6 | Mix 


cross-sectional images (n = 4 mice per group). f, Quantification of e. 

g, Analysis of Sur2a expression in hearts by (RT-PCR (n =4 mice per 
group). h, Cardiac stress resistance determined by measuring the time 

to death after injection of a lethal dose of isoproterenol. i, Change in left 
ventricular (LV) mass in response to sublethal doses of isoproterenol 

(10 mg kg~') after ten doses administered over 5 days. Scale bars, 1 mm (a), 
2\1m (b) and 200 nm (b, inset). Error bars indicate s.e.m. *P < 0.05; 

**P < 0.01; ***P < 0.001 (unpaired two-tailed t-tests). All mice, except 

for h, were C57BL/6 ATTAC. 
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regulatory subunit of Karp channels**. Cardiac Sur2a (also known as 
Abcc9) expression indeed decreased between 12 and 18 months in 
vehicle-treated mice (Fig. 5g), but no such decline was observed in 
AP-treated mice, suggesting that cardiac stress tolerance was preserved. 
To test this, we measured the time to death from arrhythmogenesis after 
injection of 580 mg kg! of the B-adrenergic agonist isoproterenol*>. 
Consistent with age-related loss of cardiac stress resistance, 18-month- 
old vehicle-treated mice died faster than 12-month-old mice (Fig. 5h). 
By contrast, the time to death was not accelerated in AP-treated mice. 
In a second, more physiological stress test, we conducted echocardi- 
ographic measurements of ventricular mass before and after eliciting 
cardiac stress over a 6-day period, by administering 10 mg kg~' isopro- 
terenol twice a day. Cardiac mass significantly increased in 18-month- 
old vehicle-treated mice, whereas AP-treated mice were capable of 
handling the applied stress without such adaptive response, similar to 
young mice (Fig. 5i). 

We also examined whether AP-treatment caused any off-target or 
detrimental effects. Wild-type mice lacking ATTAC did not show 
healthspan improvements when treated with AP (Extended Data 
Fig. 9h-l). Consistent with a beneficial role of senescent cells only for 
the duration of wound repair®®, 18-month-old AP-treated ATTAC 
mice repaired cutaneous wounds when drug administration was 
suspended during healing (Extended Data Fig. 10a—c). When AP was 
administered during wound closure, healing was delayed with similar 
kinetics as 4- or 18-month-old ATTAC mice without previous AP treat- 
ment, indicating that acute senescence mechanisms are preserved with 
ageing and not influenced by constitutive clearance of senescent cells’. 
Furthermore, 18-month-old AP-treated mice showed no evidence of 
increased fibrosis in skin and other tissues, despite a role for senescent 
cells in limiting fibrosis during tissue repair (Extended Data Fig. 10d). 


Discussion 

Here we eliminated p1 cells from non-progeroid mice using 
ATTAC to begin to address how senescent cells influence health 
and lifespan. We observe median lifespan extensions in AP-treated 
ATTAC mice on two distinct genetic Paskounds and diets, indicating 
that age-related accumulation of p16'"“*** cells negatively affects 
longevity. It will be useful to optimize senescent cell removal proto- 
cols and methods further because the longevity of male C57BL/6 mice 
seemed negatively affected by repetitive vehicle injection stress, and 
because clearance was partial and several key tissues were refractory 
to clearance, including liver and colon. This would be particularly rel- 
evant for further studies of maximum lifespan, extension of which is 
thought to be reflective of a more generalizable ‘anti-ageing’ effect”® 
(Supplementary Information). 

These methodological caveats aside, we find that removal of 
pl6'"*+ cells from midlife on delays the progression of neoplastic 
disease irrespective of genetic background and diet. Clearance had no 
effect on the spectrum or incidence of post-mortem tumours, which is 
consistent with the hypothesis that senescent cells help create a tissue 
microenvironment that promotes tumour progression!. In addition, 
several age-dependent changes”, including alterations in fat, eye, heart 
and kidney, as well as the age-dependent decline in spontaneous activ- 
ity and exploratory behaviour, occur more slowly during a six-month 
period of p16'"#** cell clearance starting at midlife. Complementary 
mechanistic studies suggest that local p16'"** cells in fat, kidney and 
heart exert their effects through distinct mechanisms, involving pro- 
genitor cell dysfunction, RAAS overactivation, and Sur2a downregu- 
lation, respectively, although systemic effects cannot be excluded given 
the global nature of the clearance method applied here. 

Thus, our data best fit a model in which plenkts+ cells act to shorten 
healthy lifespan by promoting tumour progression and age-dependent 
changes that functionally impair certain tissues and organs, includ- 
ing vital organs such as kidney and heart. This, together with the key 
observation that elimination of p16'"*** cells is not associated with any 
overt detrimental effects raises the possibility that this approach may 
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be useful to treat aspects of age-related functional decline, age-related 
diseases that involve senescent cells, or side effects of therapies that 
create senescent cells*®. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Mouse strains and drug treatments. All studies described using ATTAC mice were 
performed on transgenic line 3, which contains 13 copies of the transgene inserted 
into a single locus (data available on request). The generation and characterization 
of the ATTAC transgenic line has been previously described in detail’. Two cohorts 
of ATTAC mice were consecutively established and analysed. The first cohort 
consisted of mice on a mixed 129Sv (40% + 5%) x 57BL/6J (35% +5%) x FVB 
(20% + 5%) genetic background as determined by Harlan Laboratories using estab- 
lished single nucleotide polymorphism (SNP) panels (Supplementary Table 1). 
These mice were fed an irradiated pelleted breeder chow diet ad libitum for the 
entire duration of the study: LabDiet product 5058, with 9% fat content (21.635% 
calories from fat). Mice from this cohort were randomly assigned to be injected 
with AP20187 (AP; B/B homodimerizer, Clontech) or vehicle twice a week begin- 
ning at 12 months of age. Dosing was 0.2 |1g AP20817 per g body weight between 
12 and 18 months (the same dose as previously used for senescent cell clearance in 
BubR1 progeroid mice; see ref. 4), and 2.0 1g AP per g body weight from thereon. 
The second cohort was established ~1.5 years later after completion of ATTAC 
line 3 backcrossing onto a pure C57BL/6] genetic background (Jackson Lab stock 
number 000664). Detailed breeding information for the C57BL/6 ATTAC cohort is 
provided in Supplementary Table 2. For the entire duration of the study, congenic 
ATTAC transgenic mice were maintained ad libitum on standard irradiated pelleted 
chow diet: LabDiet product 5053 with 5% fat content (13.205% calories from fat). 
The number of mice in the C57BL/6J cohort was increased compared to the mixed 
cohort to allow for more comprehensive healthspan analyses at 18 months of age. 
The C57BL/6] cohort was randomily assigned to receive either 2.0j1g AP per g body 
weight or vehicle from 12 months until the end of life. Non-transgenic mice from 
C57BL/6J ATTAC transgenic crosses were used to establish the control cohort for 
confirmation that AP had no effect on healthspan in the absence of the ATTAC 
transgene. BubR1™/";ATTAC mice used in this study were generated as previously 
described‘. Animals were housed in a pathogen-free barrier environment through- 
out the duration of the study and kept on a 12-h light-dark cycle. Specifically, mice 
were housed 5 per cage in static autoclaved HEPA-ventilated microisolator cages 
measuring 446 cm? (27 x 16.5 x 15.5 cm), which are opened only within class II 
biosafety cabinets. Restricted entry to the room requires key card access to unlock 
the door. Entrance into the barrier facility requires wearing autoclaved gowns, 
disposable gloves, caps and shoe covers. Personnel are not allowed to enter any 
other rodent facility before entry into the barrier. Mouse colonies in this facil- 
ity are monitored through contact sentinels (one cage per rodent rack), which 
are evaluated quarterly for (and are free of) the following agents: Sendai virus, 
pneumonia virus of mice (PVM), mouse hepatitis virus (MHV), minute virus of 
mice (MVM), Theiler’s murine encephalomyelitis virus (TMEV strain GDVII), 
reovirus, rotavirus (EDIM), mouse parvovirus (MPV strain 1 and 2), murine nor- 
ovirus (MNV), Mycoplasma pulmonis, Aspiculuris tetraptera and Syphacia spp. 
In addition, the following are annually screened for (and free of) these agents: 
lymphocytic choriomeningitis virus (LCMV), ectromelia virus (mousepox), 
K virus, polyoma virus, mouse adenovirus (MAV type 1 and 2), hantavirus, 
Prospect Hill virus (PHV), mouse cytomegalovirus (MCMV), lactate dehydro- 
genase elevating virus (LDEV) Encephalitozoon cuniculi, cilia-associated respira- 
tory bacillus (CARB), mouse thymic virus (MTV, MTLV), Clostridium piliforme, 
Myobia spp. and Myocoptes spp. Autoclaved Enrich-o'Cobs (The Andersons 
Incorporated) were used as bedding. Cages and bedding were changed weekly. 
Mice had ad libitum access to municipal city water that was sterilized with ultra- 
violet light. Room temperature was maintained between 21°C and 23°C. Ear tags 
(1005-1, National Band and Tag Co.) were applied to pups at the time of genotyping 
for identification purposes. All animal procedures were approved by the Mayo 
Clinic Institutional Animal Care and Use Committee. 

Statistical analysis. Prism software was used for statistical analysis and generation 
of survival, cataract and tumour latency curves. Two-tailed unpaired t-tests were 
used for pairwise significance unless otherwise indicated. log-rank tests were used 
for survival, cataract and tumour latency curves. For maximum lifespan extension 
analysis, two-sided Wang-Allison tests were used!*. For consistency in these com- 
parisons, the following denotes significance in all figures: *P < 0.05, **P< 0.01 and 
***P < 0.001. We note that no power calculations were used. Sample sizes were 
based on previously published experiments where differences were observed. No 
samples were excluded. Investigators were blinded to allocation during experi- 
ments and outcome assessment, except for rare instances where blinding was not 
possible. 

Single-cell preparations and flow cytometry. To determine the cell type origin of 
senescent cells accumulating in fat, we prepared single cells from iWAT, collected 
the stromal vascular fraction by centrifugation and labelled the resulting cells with 
specific cell surface markers for collection of the CD45*, endothelial, and progeni- 
tor (adipocyte stem cells plus preadipocytes) cell populations by FACS as previously 
described**>°”, To quantify progenitor cells, we divided the number of progenitors 


recovered from the stromal vascular fraction by the total number of cells in the 
stromal vascular fraction. Single cell preparations from wounds were prepared in 
a similar manner. The proportion of GFP* cells in each population was assessed 
using flow cytometry as described‘. 

Analysis of age-related phenotypes. Bi-weekly checks for cataracts were per- 
formed as described?. iWAT cell diameter, adipose depot weight, skin thickness, 
and skeletal muscle diameter measurements were as described‘. Open field- 
testing was performed in a 50. cm x 50cm box and analysed with TopScan soft- 
ware (CleverSys Inc.). Sclerotic glomeruli and blood urea nitrogen assessments 
were performed as described*®. To determine cardiomyocyte cross-sectional 
area, three sagittal sections of the mid-ventricle separated by 400m were stained 
with periodic acid-Schiff using a kit (Poly Scientific R&D, k098). At least 150 
cardiomyocytes in the left ventricular free wall per section were measured using 
Image J software. Lethal isoproterenol challenges were performed as described**. 
Parameters of cardiac function were determined with echocardiography using 
Vevo2100 Imaging System (VisualSonics Inc.) with a 30 mHz transducer. Mice 
were lighly anaesthetized by isoflurane (0.8-1.5%) with minimal effect on heart 
rate. Several m-mode recordings of the cardiac parasternal short axis at the level 
of papillary muscles were made and subsequently analysed. For non-lethal iso- 
proterenol challenge, mice were analysed at baseline (day 0) and subsequently 
were administered five consecutive days of treatment with 10mg kg! twice daily 
as described*’. Mice were then assessed for alteration in LV mass on day 6. For 
longitudinal measurements of cardiac structure recordings were acquired in con- 
scious animals. At least three different recordings per animal at each time point 
were analysed. Rotarod assessments were as previously published””, Exercise ability 
was performed as described‘. Novel object discrimination tests were performed as 
described’. Grip strength was determined as previously published”. Gross hae- 
matology was assessed by analysing EDTA-treated whole blood using a Hemavet 
950 (Drew Scientific Inc.). FACS-based analyses of peripheral blood leukocyte 
populations were as previously described*?. CD3* T lymphocytes were collected 
from peripheral blood (isolated using CD3 MicroBeads and MACS columns pur- 
chased from Miltenyi Biotec) according to manufacturer's instructions. Glucose 
tolerance was assessed in mice fasted overnight by measuring blood glucose levels 
before and at time points after an intraperitoneal injection of glucose (1.0gkg'). 
Insulin sensitivity was measured following a 4 h fast by measuring blood glu- 
cose levels before and at time points after an intraperitoneal injection of insulin 
(0.6mUg_'). Serum Igfl concentrations were measured by ELISA according to 
manufacturer's instructions (R&D Systems, MG100). Wound healing assessments 
were performed as previously described“*. Phosphotungstic acid haematoxylin 
(PTAH) staining on formalin-fixed, paraffin-embedded tissue samples was 
performed as described previously*. 

Quantitative PCR. RNA extraction, cDNA synthesis and qRT-PCR analysis 
of various mouse tissues and CD3* T lymphocytes collected from peripheral 
blood were performed as described’. RNA extraction from cell lines was per- 
formed according to manufacturer’s instructions (Qiagen RNeasy mini kit). 
Sequences of primers for FKBP-Casp8, Egfp, Ink4a, Arf, p21, 116, Pail, Igfbp2 
and Mmp13 were as previously published’. In addition, the following primers 
were used: Mmp3 forward 5‘-CCTGATGT-TGGTGGCTTCA-3’, reverse 5'-TC 
CTGTAGGTGATGTGGGATTTC-3’; Pparg forward 5‘-TCTTCCATCAC 
GGAGAGGTC-3’, reverse 5‘-GATGCACTGCCTATGAGC-AC-3’; Cebpa 
forward 5‘-TTGTTTGGCTTTATCTCGGC-3’, reverse 5’-CCAAGAAG-TC 
GGTGGACAAG-3’; Illa forward 5‘-TCAACCAAACTATATATATCAGGAT 
GT GG-3’, reverse 5/-CGAGTAGGCATACATGTCAAATTTTAC-3’; Tnfa 
forward 5’-AGGGTCTGGGCCATAGAACT-3’, reverse 5‘-CAGCCTCTT 
CTCATTCCTGC-3’; Agtrla forward 5'-AAGGGCCATTTTGCTTT-TCT-3’, 
reverse 5'/-AACTCACAGCAACCCTCCAA-3’; Sur2a forward 5'-CAATG 
GAGGCTGTCAGAACA-3’, reverse 5'/-GGAGCTGACAGACACGAACA-3’; 
and Igfbp5 forward 5’-ATACAACCCAGAACGCCAGCT-3’, reverse 5/-ACC 
TGGGCTATGCACTTGATG-3’. 

SA-8-Gal staining, electron microscopy and X-ray elemental analysis. For SA-3- 
Gal staining on kidney cryosections, tissue was embedded in OCT, sectioned at 
10\1m thickness, and air-dried. After rehydration in PBS, staining was performed 
using the Cell Signaling kit (9860) with a 12-min fixation followed by incubating 
in the staining solution at 37°C for 12h. Whole mount SA-6-Gal stainings on fat, 
kidney and heart were performed using a kit (Cell Signaling), with a 20-min fixa- 
tion followed by immersing samples in the staining solution at 37°C for 12h (WAT 
and eWAT) or 48h (heart and kidney). Detection of X-Gal crystals by transmission 
electron microscopy was as previously described”. Specifically, X-Gal-containing 
whole mounts were fixed in Trump’s fixative at 4°C for 12h and routinely processed 
for OsO,/lead staining. Images were acquired and quantification performed on 
a Jeol 1400+ electron microscope with 80kV acceleration voltage. The authen- 
ticity of X-Gal crystals was confirmed by X-ray elemental analysis, whereas the 
specificity of crystal formation was verified using fat and heart tissue that was not 
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stained for SA-3-Gal. To measure the percentage of X-Gal-crystal-containing cells 
in kidney sections of AP- or vehicle-treated mice, we screened 5 grids (>1,500 cells 
per grid) per treatment group for cells X-gal-positive cells at 3,000 x magnification. 
Cells with one or more crystals and the total number of cells were counted. iWAT 
and the visceral pericardium were assessed similarly, with cells with one or more 
crystals considered to be X-Gal positive. For iWAT, we only observed crystals in 
stromovascular cells with morphology and perivascular location consistent with 
adipocyte progenitors and therefore expressed the X-Gal fraction of total cells. 
More than 500 total fat cells per animal were scored for iWAT (n= 4 animals 
per group). Total cells was estimated by extrapolating from the cellularity of five 
1,500x< fields. Owing to dissection and processing, the serous pericardium was 
lost for most samples, so the visceral pericardium alone was scored via TEM. Only 
ciliated epithelial cells stained positive for SA-6-Gal, therefore we expressed the 
X-Gal positivity of this tissue as percentage of this cell type. The entire visible peri- 
cardium on one TEM grid per animal was scored, with four animals per group. For 
quantitation of SA-S-Gal positive cells in GFP* and GFP~ cell fractions collected 
from iWAT, 100 cells were scored per fraction (n =3 animals). 

Tissue immunofluorescence staining and western blotting. Paraffin-embedded 
kidney tissue sections (5|1m) were rehydrated (2 x 10 min xylene; 2 x 5 min 100% 
ethanol; 1 x 5min 95% ethanol; 1 x 5 min 70% ethanol), incubated in protease 
from Streptomyces griseus (Sigma, P6911; 0.75 mg ml! in Tris buffer, 0.05 M, 
pH 7.6) for 1h at 37°C to retrieve antigens, blocked with Rodent Block M (BioCare 
Medical) for 30 min, and incubated with rabbit anti-Agtrla antibody (Novus Bio, 
NBP1-77078; 1:250) in antibody diluent (Dako) for 12h at 4°C. Western blotting 
on tissue lysates was performed according to standard methods, incubating the 
nitrocellulose membrane overnight at 4°C with the following antibodies: Agtrla 
(as above; 1:750), S6K (Cell Signaling, 2708; 1:1,000), phospho-S6K (Cell Signaling, 
9234; 1:1,000), AKT (Cell Signaling, 9272; 1:1,000), and phospho-AKT“"?3 (Cell 
Signaling, 9271; 1:1,000). Quantification of p-S6K/S6K and p-AKT“3 /AKT ratios 
was performed as described*®*”. 

Cell culture. ATTAC MEFs were generated as previously described’. Western 
blotting for p16'™** and mouse anti-actin was performed as described“*. Growth 
curves were generated as previously described**, RNA extraction was done with 
a Micro RNeasy kit according to manufacturer’s instructions (Qiagen). Early pas- 
sage p16-3MR MEFs° were provided by J. Campisi and M. Demaria. Early passage 
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homozygous p16-FLUC MEFs"? were provided by C. Burd. MEFs were immortal- 
ized by expression of SV40 large T antigen as described*’. The cell line genotypes 
were authenticated by PCR genotyping. They were not tested for mycoplasma 
contamination. 
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Extended Data Figure 1 | See next page for figure caption. 
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Extended Data Figure 1 | ATTAC transgene expression tracks with 
expression of senescence markers in iWAT and induces apoptosis 

of senescent cells after AP administration. a, Comparative analysis 

of SA-6-Gal activity in intact WAT. Scale bar, 0.5 cm. b, Analysis of 
endogenous Ink4a and ATTAC transcript SA-}-Gal activity in iWAT by 
qRT-PCR. H/H denotes BubR1™/# mice (n =4 mice per group). c, FACS- 
based quantification of iWAT progenitor cell numbers in 18-month-old 
ATTAC mice treated with vehicle or AP. ASC, adipocyte stem cells; PAC, 
preadipocytes. d, Expression of the ATTAC transgene and senescence 
markers in iWAT as determined by qRT-PCR (n= 4 mice per group). 
Asterisks above individual bars denote significant changes to 2-month-old 
mice; asterisks above brackets denote significant differences between 
18-month-old vehicle and AP-treated mice. e, Perirenal, mesenteric, 
subscapular and brown adipose tissue depot weights. SSAT, subscapular 
adipose tissue. f, SA-6-Gal activity in iWAT from 2-month-old ATTAC 
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mice treated with vehicle or AP beginning at weaning age. g, p16!" levels 
in iWAT from the mice described in f. Actin was used a loading control. 
h, Expression of ATTAC and senescence marker mRNA in the mice 
described in f (n =3 mice per group). i-k, Early passage non-senescent 
ATTAC MEFs express p16'"** but are not susceptible to FKBP-Casp8- 
mediated elimination when cultured in the presence of AP. i, Levels of 
p16!" in passage 3 ATTAC MEBs, with and without AP treatment. 

j, Growth curves of passage 3 ATTAC MEFs (n= 4 independently 
generated MEF lines per group), with or without AP treatment. 

k, Expression of ATTAC and senescence marker mRNA in passage 3 
ATTAC MEFs (n = 3 independently generated MEF lines per group), with 
or without AP treatment. Error bars indicate s.e.m. *P < 0.05; **P< 0.01; 
*** P < 0.001 (unpaired two-tailed t-tests). For gel source data, see 
Supplementary Fig. 1. 


© 2016 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


@ P4primary SV40LT b 
eae ee SV40 LT ATTAC MEFs 


MEFs MEFs —AP +AP (10 nM) +AP (500 nM) 
os 


Actin 


iz) 
* 


@ P4 primary ATTAC MEFs 
H SV40 LT ATTAC MEFs 


a2 
ow 
== 25 1 SV40 LT ATTAC MEFs + 10 nM AP 
gS 20 0 SV40 LT ATTAC MEFs + 500 nM AP 
se 
D Z15 
HE 
2s 10 
SE Mg 
ne 3 th 
0 xs 
Casp8 GFP p16 p21 ll6 Pai1 Mmp3 Mmp13 _ Igfbp2 = & 
>5 
p1gArF p16inkés Endogenous CDKN2A locus Ee a 
A fp-—____—_[fg. gO 5S 
XB 
Et 
A$$ $$ citc eg $< 
levee] Actin 
f g 
18 18 
16 MP4 primary 3VR MEFs 16 4 WEP4 primary p76-LUC MEFs 
GB P10 primary 3MR MEFs Hi P10 primary p16-LUC MEFs 


14 


WISV40 LT 3MR MEFs 14 7 Misv4o LT p16-LUC MEFs 


Expression 
(rel. to p4 primary 3MR MEFs) 


Expression 
(rel. to p4 primary p16'¥° MEFs) 


Oo N FF OD OD 


Lymphocyte expression 
(CD3+, rel. to 12 mo) 


Casp& Gip p16 p19 p21 Pail 6 = Mmp3 Mmp13 Igfbp2 lgfbps 
Extended Data Figure 2 | See next page for figure caption. 


© 2016 Macmillan Publishers Limited. All rights reserved 


Extended Data Figure 2 | ATTAC lacks promoter elements required 
for expression in replication-competent cells or aged lymphocytes 
expressing high levels of endogenous p16!™*, a-c, SV 40 large-T- 
antigen-immortalized ATTAC MEFs robustly express endogenous p1 
(owing to SV40 large-T-antigen-mediated inactivation of Rb) but fail to 
engage the AT'TAC transgene and are not subject to FKBP-Casp8-mediated 
elimination. a, p16!** protein levels in passage 4 (P4) primary MEFs 

and MEFs immortalized with SV 40 large T antigen. Actin was used as a 
loading control. b, p16'"*** protein levels in immortalized MEFs treated 
with vehicle or two concentrations of AP. Actin was used as a loading 
control. c, Expression of ATTAC and senescence marker transcripts in 
passage 4 primary MEFs, vehicle-treated immortalized MEFs, and 
AP-treated immortalized MEFs (n = 3 independently generated MEF lines 
per group). d, Schematic representation of the endogenous Ink4a locus 
and the various Ink4a promoter regions driving ATTAC, 3MR and firefly 
luciferase (FLUC). ATTAC and p16-3MR mice have 2.6 kb and ~50 kb 
Ink4a promoter fragments driving transgene activity, respectively. 
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p16-FLUC has firefly luciferase knocked into the endogenous Ink4a locus, 
which keeps the entire promoter region intact but ablates p16'"* protein 
expression. e, p16!" protein levels in early passage primary and SV40 
large-T-antigen-immortalized p16-3MR MEFs. f, Expression of senescence 
marker mRNA in early and late passage primary MEFs and SV40 large- 
T-antigen-immortalized p16-3MR MEFs (n= 1 independently generated 
MEF line per group performed in triplicate). g, Expression of senescence 
marker mRNA in early and late passage primary MEFs and SV 40 large- 
T-antigen-immortalized p16-FLUC MEFs (n = 1 independently generated 
MEF line per group performed in triplicate). h, Expression of ATTAC 

and senescence markers in CD3* T cells from 12- and 18-month-old 
ATTAC mice (n=5 mice per group). Error bars indicate s.e.m. *P < 0.05; 
** P< 0.01; ***P < 0.001 (unpaired two-tailed t-tests). We note that all 
values in f and g have P< 0.05 compared to passage 4 MEFs, with the 
exception of the one marked NS for not significant. For gel source data, 
see Supplementary Fig. 1. 
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Extended Data Figure 3 | ATTAC-mediated clearance of senescent cells 
is partial and tissue selective and attenuates expression of inflammation 
markers. a, Expression of the ATTAC transgene and a senescence 

marker panel, as determined by RT-PCR, in gastrocnemius, eye, kidney, 
heart (atria), spleen, lung, liver and colon (n = 4 females per group). 

b, Expression of 116, Illa and Tnfa as determined by qRT-PCR in mouse 
iWAT, kidney and skeletal muscle at different ages (n = 4 females per 
group). [16 values are as indicated in Extended Data Fig. 1d (iWAT) and 
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in a (kidney and gastrocnemius). Expression levels of inflammation markers 
in unmanipulated 18-month-old C57BL/6 females suggests that repeated 
vehicle injections were not a source of tissue inflammation. Error bars 
indicate s.e.m. *P < 0.05; **P< 0.01; ***P< 0.001 (unpaired two-tailed 
t-tests). Asterisks above individual bars in a denote significant changes to 
2-month-old mice; asterisks above brackets denote significant differences 
between 18-month-old vehicle and AP-treated mice. 
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Extended Data Figure 4 | Comparison of lifespans under different diets 
and housing facilities. a, Survival curves of unmanipulated wild-type 
C57BL/6-129Sv mice fed a 5% versus 9% fat diet. Median lifespan (in days) 
are indicated. b, Survival curves of unmanipulated wild-type C57BL/6- 
129Sv mice fed a 9% fat diet plotted against those of vehicle-treated (— AP) 
and AP-treated (+AP) C57BL/6-129Sv-FVB ATTAC mice from Fig. 2b. 
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C57BL/6 male studies: 

. This study (vehicle) 

. Sohal et al. 1994 (ref. 24) 

. Conti et al. 2006 (ref. 18) 

. Coschigana et al. 2003 (ref. 19) 
. Goodrick et al. 1990 (ref. 15) 

. Selman et al. 2008 (ref. 23) 

. Forster et al. 2003 (ref. 20) 

. Turturro et al. 1999 (ref. 26) 

. Blackwell et al. 1995 (ref. 16) 
10. Yuan et al. 2012 (ref. 27) 

11. Bronson and Lipman 1991 (ref. 17) 
12. Ikeno et al. 2005 (ref. 21) 
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C57BL/6 female studies: 

. Conti et al. 2006 (ref. 18) 

. Selman et al. 2008 (ref. 23) 

. Our study (vehicle) 

. Turtturo et al 1999 (ref. 26) 

. Coschigana et al. 2003 (ref. 19) 
. Kunstyr et al. 1975 (ref. 22) 

. Srivastava et al. 1991 (ref. 25) 

. Yuan et al. 2012 (ref. 27) 

. Zhang et al. 2012 (ref. 28) 
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These data suggest that the lifespans of vehicle-injected C57BL/6-129Sv- 
FVB control mice were quite normal for the diet that they were on, and 
unlikely to be negatively affected by repeated intraperitoneal injections. 
*P<0.01; **P < 0.001; ***P < 0.001 (log-rank tests). c, d, Median survival 
data of unmanipulated C57BL/6 male (c) and female (d) mice from various 
laboratories for comparsion to the results obtained from our facility. 
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Extended Data Figure 5 | Senescent cell clearance delays tumour 

and cataract formation. a, b, Survival curves of mixed (a) and C57BL/6 
(b) ATTAC mice dying of cancer (mice that had an overt tumour at time 

of death; only mice with lymphomas, sarcomas and carcinomas were 
included). Median survival (in days) and percentage increase are indicated. 
c, d, Incidence of macroscopically detectable neoplasms (lymphomas, 
sarcomas and carcinomas) at time of death in mixed (c) and C57BL/6 
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(d) ATTAC mice from survival cohorts. e, Survival curves of 
C57BL/6-129Sv-FVB mice dying without cancer (mice that had an overt 
tumour at time of death, including lymphoma, sarcoma and/or carcinoma, 
were excluded). Median survival (in days) and percentage increase are 
indicated. f, g, Cataract incidence for mixed (f) and C57BL/6 (g) ATTAC 
mouse cohorts. *P < 0.05; **P < 0.01; ***P < 0.001 (log-rank tests). 
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Extended Data Figure 6 | Senescent cell clearance does not affect indicated in c. f, Gastrocnemius muscle weight of ATTAC mice (n=6 
coordination, memory or exercise ability of 18-month-old ATTAC 12-month-old males and females; m = 4 18-month-old —AP males and 
mice. a, Time spent balanced during a fixed speed rotarod test for females; n =4 18-month-old +AP males and females). g-i, Myofibre 
18-month-old ATTAC mice (n =6 male and 8 female mice per group). diameter measurements on isolated gastrocnemius (g), abdominal (h) and 
b, Novel object investigation test. The percentage of investigations of a paraspinal muscle (i). Animal numbers are as indicated in f. j, Analysis of 
novel object divided by the total investigations is graphed. Key andanimal _ forelimb grip strength of ATTAC mice. Error bars indicate s.e.m. *P < 0.05; 
numbers are as indicated in a. c-e, Time-to-exhaustion (c), distance (d) **P < 0.01; ***P < 0.001 (unpaired two-tailed t-tests). 


and work (e) during a treadmill exercise test. Animal numbers are as 
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Extended Data Figure 7 | Senescent cell clearance has no effect on 
haematological parameters and age-related changes in leukocyte 
populations. a—l, Haematology results of 3- and 10-month-old untreated 
ATTAC C57BL/6 mice and 18-month-old vehicle- and AP-treated ATTAC 
C57BL/6 mice. White blood cell count (a), platelet count (b), red blood 

cell count (c), haemaglobin concentration (d), haematocrit (e), mean 
corpuscular volume (f), mean corpuscular haemoglobin (g), neutrophils (h), 
lymphocytes (i), basophils (j), monocytes (k) and eosinophils (1). 
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m-q, Assessment for leukocyte subpopulations in 3- and 10-month-old 
untreated ATTAC C57BL/6 mice and 18-month-old vehicle- and AP-treated 
ATTAC C57BL/6 mice. CD4* T cells (percentage of peripheral blood 
mononuclear cells, PBMC) (m), CD8* T cells (percentage of PBMC) (n), 
CD44" CD4* T cells (percentage of CD4*) (0), CD44"! CD8* T cells 
(percentage of CD8*) (p), and NK1.1* cells (percentage of PBMC) (q). 
Error bars indicate s.e.m. *P < 0.05; **P < 0.01; ***P < 0.001 (unpaired 
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Extended Data Figure 8 | Senescent cell removal does not affect 
somatotrophic axis signalling in vivo. a, Glucose levels following 
intraperitoneal glucose administration after an overnight fast in 
18-month-old vehicle- and AP-treated ATTAC C57BL/6 females. 


b, Normalized glucose levels after intraperitoneal insulin administration 


following a 4-h fast in 18-month-old vehicle- and AP-treated ATTAC 
C57BL/6 females. c, Serum Igf1 levels in ATTAC C57BL/6 mice (n=4 
mice of each group). d, Representative western blots for phospho-S6K, 
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total S6K, phospho-AKT**73 and total AKT in iWAT, kidney and skeletal 
muscle tissue lysates from 18-month-old vehicle- and AP-treated ATTAC 
C57BL/6 females. e, Quantification of phospho-S6K to total S6K 

ratio in blots from d, n= 4 mice of each group. f, Quantification of 
phospho- AKT“? to total AKT ratio in blots from d, n= 4 mice of each 
group. Error bars indicate s.e.m. No statistically significant differences 
were observed in a-c, e and f using unpaired two-tailed t-tests. For gel 
source data, see Supplementary Fig. 1. 
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Extended Data Figure 9 | Senescent cell clearance does not alter cardiac |= ATTAC vehicle-treated mouse values are the same as indicated in 
morphology and function in ‘resting’ mice and AP treatment has no Fig. 1. i, iWAT and eWAT depot weight (n = 4 mice per group). 18-month- 
effect on healthspan of mice lacking the ATTAC transgene. a, Electron old ATTAC vehicle-treated mouse values are the same as indicated in 
micrographs of X-Gal crystal containing cells in the aortic root. VSMC, Fig. 1. j, k, Kidney sclerosis (j) and blood urea levels (k) (n = 4 mice per 
vascular smooth muscle cell. Scale bars, 1 |1m (main panel) and 200 nm group). 18-month-old ATTAC vehicle-treated mouse values are the same 
(inset). b-g, Echocardiography measurements of heart rate (b), left as indicated in Fig. 4.1, Time to death after isoproterenol administration 
ventricular mass (c), posterior wall thickness (d), left ventricular inner (n=4 mice per group). 18-month-old ATTAC vehicle-treated mouse 
diameter (e), ejection fraction (f), and the fractional shortening of the values are the same as indicated in Fig. 5. Error bars indicate s.e.m. 


heart (g) in 12-month-old untreated mice and 18-month-old ATTAC mice _ No statistically significant differences were observed using unpaired 
treated with vehicle or AP. h, Fat mass (n= 9 mice per group). 18-month-old _ two-tailed f-tests. 
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Extended Data Figure 10 | Effect of senescent cell clearance on wound 
healing and tissue fibrosis. a, Closure of 3-mm punch biopsy wounds in 
18-month-old ATTAC females after treatment with vehicle or AP for 

6 months and if drug treatment was stopped 2 days before skin puncture or 
continued during wound closure (n = 6 wounds for —-AP;—AP and +AP; 
—AP and n= 10 wounds for —AP;+AP and +AP;+AP). AP administration 
during the wound healing process significantly attenuates the rate of 
wound closure independently of whether senescent cell removal had 
occurred before wounding. b, Closure of 3-mm punch biopsy wounds in 
4-month-old ATTAC females after treatment with vehicle or AP following 
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wounding (n= 10 wounds per group). Similar to 18-month-old mice, AP 
administration during the wound healing process dramatically attenuated 
the rate of wound closure. c, Quantification of total GFP* cells isolated 
from 3-mm punch biopsy wounds of 4-month-old mice two days into the 
wound healing process treated with vehicle (black) or AP (red, n =3 mice 
per group). d, PTAH-stained tissues sections from 18-month-old ATTAC 
mice for detection of fibrosis. Scale bars, 100 1m. Error bars indicate s.e.m. 
Mice receiving AP during the healing process in a and b are significantly 
different from those treated with vehicle from day 1.5 to day 9.5. *P < 0.05 
(unpaired two-tailed t-tests). 
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Laser-plasma accelerators (LPAs) are capable of accelerating charged 
particles to very high energies in very compact structures’. In theory, 
therefore, they offer advantages over conventional, large-scale 
particle accelerators. However, the energy gain in a single-stage LPA 
can be limited by laser diffraction, dephasing, electron-beam loading 
and laser-energy depletion’. The problem of laser diffraction can 
be addressed by using laser-pulse guiding” and preformed plasma 
waveguides to maintain the required laser intensity over distances of 
many Rayleigh lengths’; dephasing can be mitigated by longitudinal 
tailoring of the plasma density‘; and beam loading can be controlled 
by proper shaping of the electron beam’. To increase the beam energy 
further, it is necessary to tackle the problem of the depletion of laser 
energy, by sequencing the accelerator into stages, each powered by 
a separate laser pulse®. Here, we present results from an experiment 
that demonstrates such staging. Two LPA stages were coupled over 
a short distance (as is needed to preserve the average acceleration 
gradient) by a plasma mirror. Stable electron beams from a first 
LPA were focused to a twenty-micrometre radius—by a discharge 
capillary-based’ active plasma lens*—into a second LPA, such that 
the beams interacted with the wakefield excited by a separate laser. 
Staged acceleration by the wakefield of the second stage is detected 
via an energy gain of 100 megaelectronvolts for a subset of the 
electron beam. Changing the arrival time of the electron beam with 
respect to the second-stage laser pulse allowed us to reconstruct the 
temporal wakefield structure and to determine the plasma density. 
Our results indicate that the fundamental limitation to energy 
gain presented by laser depletion can be overcome by using staged 
acceleration, suggesting a way of reaching the electron energies 
required for collider applications’. 

The limitations of conventional particle-accelerator technology’ are 
motivating the development of advanced particle-acceleration tech- 
niques, such as laser-plasma acceleration, which could see a broad 
range of applications—ranging from particle colliders that produce 
energies beyond 10’? electronvolts (TeV)? to compact free-electron 
lasers and Thomson 7-ray sources. Within the past few years, tre- 
mendous progress in LPA development has been made. After the first 
demonstration of per-cent-level energy spread and small divergence 
in millimetre-scale plasmas in 2004 (refs 11-13), electron beams with 
energies of 10°eV (GeV) were obtained with 40-terawatt laser pulses'*. 
Subsequently, electron beams with multi-GeV energies were reported 
with petawatt-class laser systems and plasmas of a few centimetres in 
length'>-’”. Controlling the injection of electrons into plasma waves 
enables the accelerator to be precisely tuned!*7!, 

The accelerating gradient, E,, of a single-stage LPA scales with the 


plasma density, n,, as E, x ne! > The single-stage length, Lstage, is given 


by the laser-depletion length, Laeplete: Lstage © L deplete a,” /2 Thus, the 
energy gain per stage scales a5 Woage +. With the help of particle- 
Ne 


in-cell simulations, it has been shown” that, in order to reach an energy 


of 1 TeV ina single stage, a plasma density of about 10’°cm~? is 


required. This would result in an acceleration length of 1 km, a low 
acceleration gradient, 10 kJ of required laser pulse energy, and an elec- 
tron bunch that is not suitable for collider applications’. However, 
staging using multiple petawatt laser systems would allow for the use 
of much higher plasma densities, and hence the generation of higher 
accelerating gradients; this would result in a reduction in the total 
LPA-based linear-accelerator length to a few hundred metres, as well 
as more favourable laser parameters and electron-bunch charges”. To 
obtain such a compact set-up, coupling distances of the order of the laser- 
depletion length, at the 1-metre scale, are assumed. Because the flu- 
ence restrictions of conventional laser optics require them to be posi- 
tioned several metres away from the focal plane of the laser, plasma 
mirrors” have been proposed instead as the final steering optics”*. 
Such a compact staging set-up is also important to photon sources (for 
example when using +)-rays to inspect materials)”, where it can be 
used to decelerate electrons after photon production to mitigate shield- 
ing needs. 

Here we demonstrate coupling of, and acceleration in, two sepa- 
rately powered LPA stages. Two synchronized laser pulses were applied 
to drive two acceleration stages in series (Fig. 1). The first stage gen- 
erated electron beams from a gas-jet target with a central energy of 
120 MeV (see Methods). To maximize the coupling efficiency to the 
second stage, these electron beams were refocused by a first discharge 
capillary, acting as an active plasma lens®, to the entrance of a second 
discharge capillary, serving as the second-stage target. The acceler- 
ation fields in the second capillary were excited by the second laser 
pulse, reflected by a tape-based plasma-mirror a few centimetres away. 
Depending upon the relative timing of the two laser pulses, an energy 
gain of about 100 MeV might be observed, with a charge-coupling 
efficiency of 3.5%. Continuous scanning of the relative timing of the 
laser pulses allowed us to reconstruct the femtosecond-scale temporal 
field structure of the second-stage wake, providing an important wake 
diagnostic. Numerical modelling confirms the effective trapping of the 
electron beam in the second-stage wake structure, and provides evi- 
dence for the femtosecond duration of the first-stage electron beams. 

The electron beams generated in the first stage were transported to 
the second-stage target using a pulsed active plasma lens®. Radially 
symmetrical focusing was achieved in a gas-filled, 15-mm_-long cap- 
illary with a diameter of 500 jm, using an axial discharge current of 
650 A, which produced an azimuthal focusing magnetic field. The 
high field strengths produced (~0.5 tesla) re-focused electrons with 
energies of 75-125 MeV within a distance of 25mm through the 
plasma-mirror tape to an energy-dependent spot size of 20-30 1m 
(r.m.s.) at the second plasma stage (Fig. 1 inset). The divergence 
acceptance of the lens was 5 mrad. 

The second-stage LPA target was formed by a separate discharge 
capillary structure (see Methods). The discharge current created a 
pre-formed plasma that served as a waveguide, guiding the driving 
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Figure 1 | The experimental set-up. In stage I, a pulse of laser light is 
focused on a gas jet, producing an electron beam. This beam is then 
transported to the entrance of stage II by a discharge capillary, which 

is acting as an active plasma lens. In stage II, the beam enters a second 
discharge capillary. A second laser pulse further accelerates the electrons; 
this laser is coupled to the second discharge capillary via a plasma-mirror 
tape. Lanex screens are used to detect the energy integrated and 


laser pulse over many Rayleigh lengths, minimizing diffraction and 
extending the acceleration length. These target systems are well char- 
acterized*!*, and a model has previously been developed that permits 
the wakefield amplitude to be determined by means of the spectral 
redshift of the transmitted laser!>7°*”. We used a feedback-controlled, 
tape-based plasma mirror (see Methods) to combine the injected 
electron beam with the laser driver in the second stage. 

The laser pulses reflected off the plasma mirror were guided in the 
parabolic plasma channel created in the discharge capillary with an 
energy transmission of 85%. Matched propagation of a transversely 
Gaussian laser pulse in a plasma with a transverse parabolic density 
profile can be obtained—at low laser power and intensity—if the 
input-laser spot size, wo (which corresponds to a radius whereby 
the laser intensity is 1/e” compared to the on-axis value), equals the 
matched spot size, rn, (For a parabolic plasma profile, n(r) =p + ar’, 
where no is the on-axis density, ris the transverse spatial coordinate in 
the plasma channel, and a is the parameter controlling the depth of 
the channel; the matched spot size is given by rm= (amre) 1! 4 with 
Te=2.8 x 10-9 cm being the classical electron radius.) In our experi- 
mental conditions, r,,= 45m, and the laser spot size at focus was 
Wo = 18 1m, leading to mismatched propagation and, hence, to varying 
peak intensities and wakefield strengths along the capillary. The char- 
acteristic oscillation length of the laser spot size is given by \os=7Zpm, 
where Zam = nr, / X, and A\=0.8 1m is the central wavelength of the 
laser. For our parameters, \os = 25mm. Wake excitation under these 
conditions was confirmed by measuring optical spectra of the trans- 
mitted laser pulse, showing an increasing redshift with increasing 
plasma density in the channel. Quantitative analysis of the spectra 
revealed a maximum relative redshift of 3% with respect to the central 
wavelength of the laser at a density of 2 x 10'8cm~?. This corresponds 
to an average field amplitude of about 17 MV mm! if wake excitation 
occurs over the full length of the capillary”®. 

To control the phasing of the electron beam in the plasma wake of 
the second-stage LPA, we varied the delay between the laser pulses 
that drive the first and second stages, with femtosecond precision, 
with an optical-delay stage in the laser beam line of the injector stage. 
Electron spectra were recorded as a function of the delay between the 
two laser pulses. In the case of a positive delay, the first-stage elec- 
trons propagated without the influence of the second laser pulse. After 
the second laser pulse arrived (negative delay), the electron spectra 
were periodically modulated in energy (Fig. 2a). The period of the 
modulation was 80 + 6 femtoseconds, consistent with a plasma wave- 
length A, of 24m, at a density of (1.9 £0.3) x 10'8cm~*. The constant 
periodicity of the observed modulation as a function of delay behind 
the driver pulse further indicates a quasilinear wake, consistent with 
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energy-dispersed (as part of a dipole spectrometer) electron profiles. The 
inset shows how the diameter of the waist (the ‘spot size’) of the electron 
beam evolves along the beam path (z), simulated for different electron- 
beam energies produced by the first stage, according to ref. 8. Energies 

in the interval 75-125 MeV are focused at the entrance of the stage II 
capillary to spot sizes of the order of the input-laser spot size (18 um). 


expectations for the experimental parameters, including laser intensity 
and plasma density. 

To investigate the influence of the second-stage wakefield on the 
electron beam in detail, we subtracted the reference spectrum result- 
ing from an unperturbed beam (positive delay) from the spectrum 
at each delay, to emphasize the effect of the second laser pulse while 
maintaining absolute charge information. The resulting electron dis- 
tributions are plotted in Fig. 2b in the form of a waterfall plot of elec- 
tron spectra, where each horizontal line corresponds to an energy 
spectrum that is averaged over five shots. Background-subtracted 
two-dimensional charge maps for the first two peaks and valleys of 
the blue curve in Fig. 2a, also averaged over five shots, are shown 
in Fig. 2d-g. The presence of the second-stage laser results in a 
reduction in total beam charge by up to a factor of three (Fig. 2a). 
For appropriate timing of the second-stage laser, however, charge 
was detected beyond the energy cut-off of the input electron spec- 
trum, that is, >200 MeV. This charge accelerated beyond the cut-off 
of the input spectrum (red and yellow areas in Fig. 2b, d, f), which 
indicates acceleration in the second stage. The integrated charge of 
1.2 pC in this region represents the charge trapped in the acceler- 
ating phase of the wake, corresponding to a trapping efficiency of 
3.5%. At delays of Ap/2 after the times of maximum energy gain, 
roughly 1 pC of additional charge was detected around 110-150 MeV 
(Fig. 2e,g). This could correspond to electrons that have deceler- 
ated, or to electrons that have been deflected by the transverse wake 
fields into the spectrometer acceptance. The broad energy spread of 
the first-stage electron beam prevents unambiguous observation of the 
decelerating phase of the wake under these conditions. 

Numerical modelling performed with the code INF&RNO”®?? 
allows detailed analysis of the interaction. Figure 3a shows reference- 
subtracted electron spectra as a function of the delay between the 
arrival of the electron bunch and the laser pulse. The simulations 
show that the observed energy modulations depend on the phasing 
of the electron bunch within the wake. The periodicity of the modu- 
lation is determined by the plasma density and is consistent with the 
experimental observation. However, the amount of post-accelerated 
charge decreases in the later accelerating phases of the wake as a result 
of increasing wake curvature. The fact that the linearity of the wake 
appears to be preserved in the experimental results could be attributed 
to a deviation from the parabolic plasma channel. We have found that, 
for example, simulating a quartic plasma density profile yields a charge 
distribution similar to that obtained in the experiment (Extended Data 
Fig. 1). Simulations performed assuming matched guiding conditions, 
and a more-energetic injector beam with reduced energy spread, indi- 
cate that roughly 90% trapping can be achieved (Extended Data Fig. 2). 
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a 300 T T T T 1 50 Figure 2 | Spectra of electron beams produced by staged 
r acceleration. a, Maximum electron energy (blue) and 
280 |- | total electron-beam charge (red) as a function of the delay 
z= = 40 between the two driving laser pulses. ‘Positive delays’ 
= 260 correspond to times before the arrival of laser 2. A single 
5 | © data point represents an average of five measurements; error 
© 240 430 2 bars represent the standard deviation. b, Waterfall plot of 
. a ® electron spectra (five-shot average), each with the reference 
= 220} | 5 from panel c subtracted, as a function of delay. c, 100-shot 
4 20 average unperturbed reference for delays of 100-300 fs 
S 200% before the arrival of the second laser pulse. c-g, 
] Two-dimensional charge maps (five-shot average), with 
180 io reference (c) subtracted for the first two maxima and 
minima of the energy oscillation shown in a—that is, for 
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As discussed above, there were two regions of increased laser 
intensity—and hence higher wake amplitude—in the capillary, owing 
to mismatched laser-pulse guiding. In Fig. 3, we plot the evolution 
of electron energy (Fig. 3b) and bunch size (Fig. 3c) as a function of 
propagation in the capillary for two different electron populations at 
a delay of —252 femtoseconds. Electrons that had an initial energy in 
the range 75-125 MeV did not gain a notable amount of energy, and 
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Figure 3 | Simulation results. a, Waterfall plot of electron energy 

spectra as a function of delay, with the same colour scale as in Fig. 2b. 

The reference was subtracted from each spectrum in a similar way as for 
the experimental results. b, Black line, evolution of the laser intensity, 
expressed as the relativistically normalized laser vector potential ao, as a 
function of propagation in the capillary (z denotes the location along the 
plasma channel) in the second-stage LPA. Red and blue lines, evolution 

of electron energy as a function of propagation in the capillary for a delay 
of —252 fs for two different electron populations: electrons with an initial 
energy in the interval 75-125 MeV (blue) and electrons with a final energy 
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were strongly defocused by the transverse wakefield in the early stages 
of the laser-plasma interaction. On the other hand, electrons with 
a final energy above 200 MeV experienced an energy gain of about 
100 MeV in the vicinity of the second laser focus, corresponding to a 
propagation distance of z=24-29 mm in the plasma, where—because 
of the focusing induced by the discharge current and the laser-induced 
wake—they reach a spot size of roughly 5 .m and interact strongly 
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of 200-300 MeV (red). c, Evolution of the electron-beam r.m.s. spot size 
(also called bunch size) for the same electron-beam subsets as in panel 

b, along with the electron-beam r.m.s. spot size of the ‘red’ electrons 
without influence of the wakefield (red dashed line). d, Total transverse 
force on the electron beam divided by the contribution from the discharge 
current F(z)/F(z= 0), for two different distances from the axis: 101m 
(black) and 201m (grey). (F(z =0) is the contribution from the discharge 
current.) The area between the grey dashed lines in b-d indicates where 
the electrons with final energies greater than 200 MeV are trapped and 
accelerated in the laser-induced wakefield. 
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with the laser-driven plasma wake. This is shown in Fig. 3c, where we 
plot (red solid line) the evolution of the transverse electron bunch size 
as a function of the propagation distance. The red dashed line in Fig. 3c 
shows the evolution of the bunch spot size without the influence of the 
laser pulse, indicating that the contribution of the external magnetic 
field (induced by the discharge current of the second capillary) to the 
trapping of the electrons cannot be neglected. 

To elucidate this, we computed in the simulation the amplitude of the 
transverse force, F(z), acting on the electron bunch at distances of 101m 
and 20m from the capillary axis. The focusing contribution from the 
externally applied discharge current is given by F(z=0), where there 
is no wake contribution. We plot the ratio F(z)/F(z=0) in Fig. 3d. For 
propagation distances in the capillary of z< 10 mm, the average spot 
size of electrons with final energies of 200-300 MeV is larger than the 
laser spot; hence, the external field prevails and focuses the electron 
beam. At positions 10mm <z< 22mm, the laser intensity is reduced 
markedly because of mismatched guiding (Fig. 3b); hence the focusing 
force from the externally applied current is of the same order as the 
wakefield contribution, leading to continued focusing of the electron 
beam. Finally, for positions 24mm <z< 29mm, the contribution of 
the wakefield dominates the interaction. In this range, a subset of the 
electron beam is trapped in the wakefield and gains energy. An energy 
gain of about 100 MeV is observed in the simulations. For comparison, 
the experiment was repeated with the reversed (that is, defocusing) cur- 
rent in the second capillary. In this case, the external focusing field that 
supports trapping in the region 24mm <z< 29mm is absent. Trapping 
still occurs in the region of the first laser focus (2mm <z< 5mm), but 
electrons trapped here are subsequently dephasing/defocusing and thus 
do not experience a detectable energy gain at the end of the second 
capillary. The full data set is shown in Extended Data Fig. 3. 

In summary, we have presented an experimental study of the cou- 
pling of two LPA stages that are independently driven by two syn- 
chronized laser pulses. We demonstrated electron-beam injection and 
capture into the second-stage LPA by means of approximately a 100- 
MeV energy gain that recurs at delays corresponding to multiples of 
Ap: The observation of temporally well defined energy modulations 
further implies directly that the bunch length of the input electron 
beam is shorter than A,/4 5m. This represents a milestone in the 
development of laser-driven, plasma-based accelerators for particle 
colliders, and for any other LPA application that requires electron 
energies beyond the limits of single stages. Simulations of modules 
that provide multi-GeV energy boosts to high-quality electron beams 
by operating at lower densities (that is, with a larger transverse wake 
size) suggest that 100% trapping can be achieved. The demonstration 
of such collider-relevant stages—including synchronizing the timing 
of separate laser drivers—will be the focus of our future work. 
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METHODS 


Laser pulses with a centre wavelength, A, of 0.8,1m, produced by the TREX 
Ti:sapph laser at the BELLA Center, were split with a MgF, beamsplitter of 5-mm 
thickness. For stage I (2), 70% (30%) of the 40-TW laser-pulse energy—that is, 
1.3J (0.45J) of laser energy—was focused by a parabola of focal length 2m toa 
beam waist, wo, of 18,um. This corresponds to a laser intensity of 4 x 10!8W cm~? 
(1.4 x 10'8W cm’) at a pulse duration of 45 fs. 

Plasma mirror. A feedback-controlled, VHS-tape-based plasma-mirror system 
was used to combine the injection beam with the laser driver for the second stage. 
The electron beam passed through the tape, while the laser beam was reflected at 
90°. This plasma mirror” has a laser-energy throughput of 80% (that is, 480 mJ 
laser energy), results in no observable laser mode or pointing degradations, and 
has been fully characterized and improved to provide uninterrupted operation at 
a repetition rate of 1 Hz for hours of run time. 

Electron-beam diagnostic. The energy spectrum of electron bunches emerging 
from the plasma was measured by a magnetic electron spectrometer. It used a 
120mm by 480 mm rectangular-pole electromagnet with the maximum field 
strength of 1.5 T (ref. 30). Deflected electrons went through scintillating screens 
(Lanex, Kodak) that were arranged to optimize the energy resolution*!, and the 
resulting light was imaged by three synchronously triggered charge-coupled-device 
(CCD) cameras, enabling single-shot detection of electrons with energies in the 
range 0.055-0.25 GeV and 0.3-2.1 GeV for 1 T. 

Gas jet. A supersonic deLaval nozzle type gas jet with a diameter of 700 1m was 
used as the first-stage target. At backing pressures of 150 psi (with the gas being 
a mixture of 99% helium and 1% nitrogen), the neutral density profile exhibited 
an approximately Gaussian shape, verified by interferometric measurements”. 
The plasma density at the interaction point with the laser pulse was 5 x 10'*cm~?. 
Stage I electron beam. To establish a stable injector stage, 70% of the energy of the 
BELLA Center TREX laser pulse was focused onto a supersonic gas jet (see above). 
A composition of two gases (99% helium and 1% nitrogen) was used to increase the 
amount of trapped charge. In such a mixed gas, injection was achieved by ionizing 
deeply bound electrons from a gas with high atomic number (nitrogen) near the 
peak of the laser pulse, the proper phase inside the wakefield allowing them to be 
trapped****, Stable electron beams with a mean energy of 120 +5 MeV, a 60% (full 
width at half-maximum) energy spread, a beam charge of 33 + 5 pC, and a low 
divergence of 4+ 0.3 mrad were routinely produced, with a pointing stability of 
0.3 mrad (standard deviation) over hours of run time (corresponding to thousands 
of laser shots and more than ten days). 

Capillaries. The capillary used as the active plasma lens had a diameter of 500 ,1m 
and a length of 15mm. Hydrogen gas was flowed into the capillary through slots 
located 2mm from each end of the capillary. The temporal profile of the discharge 
current was approximately sinusoidal, with a peak current of 650 A. Data were 
taken for laser arrival at the peak of the discharge, and allowed focusing of the 
100-MeV electrons to the entrance of the accelerator-stage capillary, which was 
located 25 mm downstream of the exit of the active plasma lens. The capillary used 
as the second stage had diameter of 250 1m and length of 33 mm. Hydrogen gas was 
flowed into the capillary through slots located 2mm from each end of the capillary. 
The discharge current was approximately sinusoidal, with the peak current being 
325 A. Data were taken for laser arrival at the peak of the discharge. 


The two discharge capillaries have identical lens strength and form an imaging 
telescope for the electron beam. We verified independently that an electron prop- 
agating through this imaging telescope, without the influence of the second-stage 
laser pulse, has the same final divergence as the input electron beam, in the range 
of 60 to 300 MeV that is relevant for this experiment. 

Numerical modelling. Calculations of the laser-plasma interaction—including 
laser and wakefield evolution—and of electron acceleration/modulation were car- 
ried out with the computational framework INF&RNO* (integrated fluid and 
particle simulation code). INF&RNO is a two-dimensional, cylindrical (r—z) code 
that adopts an envelope model for the laser pulse, and makes use of the approxi- 
mation of the ponderomotive force to describe the interaction of the laser pulse 
with the plasma. In the simulations presented here, the background plasma is 
modelled via a (noiseless) cold fluid description, while a fully kinetic description, 
realized through the particle-in-cell approach, is used for the electron bunch. The 
laser envelope is evolved in time using a second-order Crank—Nicolson method. 
The full wave operator is retained in the evolution equation for the laser envelope. 
At any given time step, once the laser envelope is specified, the (fluid) wakefield 
is computed using a quasistatic approach. The effect of the discharge current in 
stage II is taken into account by adding a radially varying azimuthal magnetic field 
corresponding to a discharge current of 325 A. Evolution of the electron bunch 
over time is computed using a fourth-order Runge-Kutta integrator. The input 
laser pulse is modelled according to the measured transverse intensity profile of 
the laser. The background plasma profile in stage II has a length of 33 mm, with 
3-mm initial/final ramps and a 27-mm-long longitudinal flat-top section in the 
centre. The transverse density profile is parabolic with a matched radius, rm, of 
451m. The initial (that is, after the first LPA stage) phase space distribution for 
the electron bunch is modelled according to the measured bunch properties (see 
the spectrum in Fig. 2c). The initial bunch length was varied in the range 1 1m to 
30 1m (longitudinal flat-top profile), while the initial bunch radius was chosen in 
the range 0.2—5 1m (transverse flat-top profile). Simulation results are insensitive 
to the choice of the initial bunch radius within the aforementioned range. In each 
simulation, the complete transport of the electron bunch from the exit of stage I 
to the entrance of stage II is modelled, including the effect of the active plasma 
lens (powered with a current of 650 A) and the interaction (scattering) with the 
plasma-mirror tape. The final (that is, after stage II) electron spectra have been 
computed taking into account the finite acceptance of the spectrometer. 

Code availability. The INF&RNO code is part of BLAST (Berkeley Lab Accelerator 
Simulation Toolkit). For more information on how to access the code, visit 
http://blast.Ibl.gov/. 
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Extended Data Figure 1 | Simulation results with quartic transverse 
plasma density profile. a, Waterfall plot of electron energy spectra as a 
function of delay, with the same colour scale as in Fig. 2b. The reference 
spectrum (that is, the spectrum computed for delays greater than 0, 
without the influence of laser 2) was subtracted from each simulation 
spectrum in a similar way as for the experimental results in Fig. 2b. 

b, The quartic transverse density profile used in the simulation is 


LETTER 


2.50 


nj(r) (x10"°cm?) 
N 
N 
uw 


2.00 


0 20 40 60 80 100 120 140 


n(r) = m9 + car? + $(1 —€)(mar)!/?r4, with ¢=0.7 (red dashed line). 
The chosen form for the quartic profile is such that for any chosen 

€ (0<e<1), the matched spot size in the limit of a low laser power and 
low laser intensity is the same as for the parabolic profile ¢ = 1 (blue line). 


no is the on-axis density, r is the transverse (radial) coordinate, and a is the 
parameter controlling the depth of the plasma channel. 
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90% trapping of the electron beam can be achieved. The injector electron 
beam had a central energy of 350 MeV, a charge of 10 pC, an energy spread 
of 6% (r.m.s.) and a divergence of 2.5 mrad (full width at half-maximum). 
The laser pulse energy was 1 J. The laser spot size was wo = 401m. The red 
line shows the fraction of trapped charge (scale on the right). 


Extended Data Figure 2 | Simulation with optimized laser-plasma 
parameters. Waterfall plot of electron energy spectra as a function of 
delay. This simulation assumed matched laser-guiding conditions, and a 
more-energetic injector beam with reduced energy spread (as compared 
with the experimental set-up); the simulation results indicate that roughly 
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Extended Data Figure 3 | Experimental results obtained with areversed —_ unperturbed reference beam. Electrons trapped in the region of the first 


(defocusing) current direction in the second LPA stage. Main figure, laser focus will subsequently dephase/defocus under these conditions, and 
a waterfall plot of electron spectra (subtracted by the reference) as a thus do not experience a detectable energy gain at the end of the second 
function of delay; the electron charge density is colour coded, and the capillary. 


total energy of the electron beam is shown as black circles. Inset, the 
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Observing the Rosensweig instability of a quantum 


ferrofluid 


Holger Kadau!, Matthias Schmitt!, Matthias Wenzel’, Clarissa Wink!, Thomas Maier!, Igor Ferrier-Barbut! & Tilman Pfau! 


Ferrofluids exhibit unusual hydrodynamic effects owing to the 
magnetic nature of their constituents. As magnetization increases, 
a classical ferrofluid undergoes a Rosensweig instability’ and creates 
self-organized, ordered surface structures” or droplet crystals’. 
Quantum ferrofluids such as Bose-Einstein condensates with strong 
dipolar interactions also display superfluidity*. The field of dipolar 
quantum gases is motivated by the search for new phases of matter 
that break continuous symmetries™®. The simultaneous breaking of 
continuous symmetries such as the phase invariance in a superfluid 
state and the translational symmetry in a crystal provides the 
basis for these new states of matter. However, interaction-induced 
crystallization in a superfluid has not yet been observed. Here we 
use in situ imaging to directly observe the spontaneous transition 
from an unstructured superfluid to an ordered arrangement of 
droplets in an atomic dysprosium Bose-Einstein condensate’. By 
using a Feshbach resonance to control the interparticle interactions, 
we induce a finite-wavelength instability’ and observe discrete 
droplets in a triangular structure, the number of which grows as 
the number of atoms increases. We find that these structured states 
are surprisingly long-lived and observe hysteretic behaviour, which 
is typical for a crystallization process and in close analogy to the 
Rosensweig instability. Our system exhibits both superfluidity and, 
as we show here, spontaneous translational symmetry breaking. 
Although our observations do not probe superfluidity in the 
structured states, if the droplets establish a common phase via 
weak links, then our system is a very good candidate for a supersolid 
ground state?!" 

Research in condensed matter physics is driven by the discovery of 
novel phases of matter, in particular, phases simultaneously display- 
ing different types of order. A prime example is the supersolid state, 
which features crystalline order and superfluidity simultaneously 7". 
This state has been elusive and claims of its discovery in helium!”"? 
have been withdrawn recently“. One of the requirements for spon- 
taneous spatially ordered structure formation in the ground state of a 
many-body system is the existence of long-range interactions such as 
those present in ferrofluids. As a consequence of these interactions, a 
magnetized ferrofluid forms stationary surface waves due to competi- 
tion between gravitational or magnetic trapping, dipolar interaction 
and surface tension. This effect is known as the normal-field instabil- 
ity or Rosensweig instability” and leads to stable droplet patterns on a 
superhydrophobic surface’. For ferrofluids, the dispersion relation of 
surface excitations has a minimum at finite momentum, resembling 
the well-studied roton spectrum in liquid helium!>. However, the 
physical interpretation of this minimum is very different for ferro- 
fluids and helium. The origin of the dispersion-relation minimum for 
a ferrofluid is an energy gain due to the attractive part of the dipolar 
interaction resulting in a clustering of polarized dipoles in a head- 
to-tail configuration in periodic structures. Such roton-induced 
structures have also been discussed for quantum ferrofluids*!®. 
In close similarity with a classical ferrofluid, a competition exists 
between the harmonic trapping, dipolar interaction and contact 


interaction in a quantum ferrofluid. For increasing relative dipolar 
interaction, the roton instability can lead to a periodic perturbation 
of the atomic density distribution, which is closely connected to the 
Rosensweig instability'”. However, it was believed that these rotonic 
structures would be unstable, owing to subsequent instabilities of the 
forming droplets’®. 

Here we cool down the most magnetic element—dysprosium (Dy)’’, 
with a magnetic moment of /1= 9.93 1p, where jug is the Bohr magneton— 
and generate a Bose-Einstein condensate (BEC)’. We observe an angu- 
lar roton instability'®!* and find subsequent droplet formation yielding 
triangular structures with surprisingly long lifetimes. We use two key 
tools to study these self-organized structures. First, we use a magnetic 
Feshbach resonance” to tune the contact interaction (see Extended 
Data Fig. 1) and to induce the droplet formation. Second, we use a 
microscope with high spatial resolution to detect the atomic density 
distribution in situ. 

The first prediction of structured ground states in a dipolar BEC 
dates back to the early days of quantum gases”; the first mechanical 
effects were seen with chromium atoms”. There, the dipolar attraction 
deforms the compressible gas and its shape is balanced by a repulsive 
contact interaction, described by the scattering length a. To compare 
the strengths of the contact and dipolar interaction, we introduce a 
length scale that characterizes the magnetic dipole-dipole interaction 
strength: dag = [lopi?m/(12 Th) (ref. 2), where jug is the vacuum perme- 
ability, m is the atomic mass and h is the reduced Planck constant. By 
tuning the scattering length a with a Feshbach resonance such that 
a< ga, the dipolar attraction dominates the repulsive contact inter- 
action and an instability of a dipolar gas can occur*”*. However, in a 
pancake-shaped trap, the dipoles sit mainly side-by-side and predom- 
inantly repel each other, owing to the anisotropy of the dipole-dipole 
interaction, and hence the dipolar BEC is stabilized. In such a pan- 
cake-shaped configuration, the roton instability, which occurs at a finite 
wavelength, is predicted®!°, 

For our experiments, we used the isotope '*Dy with a dipolar 
length of aga= 132d, where ap is the Bohr radius. This dipolar length 
is already greater than the background scattering length apg = 92(8)ao 
(where the errors in parentheses here and elsewhere represent one 
standard deviation), which is the value far from Feshbach reso- 
nances***°. To obtain a stable BEC, we tuned the scattering length 
to a aqq using a magnetic field of B= 6.962(3) G in the vicinity ofa 
Feshbach resonance located at By =7.117(3) G. We then obtained typ- 
ically 15,000 atoms in nearly pure Dy BECs (see Methods section). The 
atoms were trapped in a radially symmetric, pancake-shaped trap with 
harmonic frequencies of (1, Vy, Vz) = (46, 44, 133) Hz, and the external 
magnetic field aligned the magnetic dipoles in the axial z direction. 
Subsequently, we tuned the magnetic field such that BS 6.9 G, which 
reduced a to apg < aga, resulting in an angular roton instability'® that 
triggered the transition to ordered states (Fig. 1a). We then observed 
the formation of droplets that arranged in ordered structures using 
in situ phase-contrast polarization imaging along the z direction with 
a spatial resolution of 1 zm. 
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Figure 1 | Growth of a microscopic droplet crystal. a, Schematic of the 
experimental procedure. We prepared a stable, strongly dipolar Dy BEC 
with a ¥ agq in a pancake-shaped trap (left). By decreasing the scattering 
length a, we induced an instability close to a apg. Following this 
instability, the atoms clustered to droplets in a triangular pattern (right). 
b, Representative single samples of droplet patterns imaged in situ, with 
droplet numbers, Na, ranging from two to ten. c, We used a set of 112 
realizations with different numbers of droplets and atoms for a statistical 
analysis. The plot shows the mean number of atoms as a function of the 
number of droplets Na, with error bars indicating the standard deviation. 
The fitted linear relation (grey dashed line) has a slope of 1,750(300) atoms 
per droplet. This shows that increasing the number of atoms results in 
growth of the microscopic droplet crystal. 


In Fig. 1b, we show typical in situ images of the resultant triangular 
patterns for the quantum ferrofluid with different numbers of droplets, 
Ng; ranging from two to ten. To analyse the average number of atoms 


LETTER 


per droplet, we count the number of droplets Ng in relation to the total 
number of atoms. Figure 1c indicates a linear dependence between Ng 
and the number of atoms, with a slope of 1,750(300) atoms per droplet. 
For Na=2, we observe a droplet distance of d=3.0(4) tum. The droplets, 
which have a large effective dipole moment of Na/, strongly repel each 
other while the radial trapping applies a restoring force. Hence, the dis- 
tance d can be calculated using a simplified one-dimensional classical 
system by minimizing the energy of the system. 

We assume two strongly dipolar particles with 1,750 times the mass 
and magnetic moment of a Dy atom that are confined in a harmonic 
trap. For our experimental parameters, these particles minimize their 
energy with a distance of d= 3.3 1m, in agreement with the observed 
distance. For Na > 2, the droplets arrange mostly in triangular struc- 
tures, and form a microscopic crystal with a droplet distance of 
d=2-3 1m. Owing to the isotropy of the repulsion between droplets 
in the radial plane, we expect the triangular configuration to have the 
lowest energy. Because of the repelling dipolar force between the drop- 
lets, we observe in the radial direction nearly round, discrete droplets 
with possible weak overlap to neighbouring ones. 

Comparing our quantum ferrofluid with a classical ferrofluid, very 
similar behaviour and patterns have been observed on a superhydro- 
phobic surface’. In this classical-ferrofluid system, a single droplet 
first deforms as the external magnetic field increases, and then divides 
into two droplets when some critical field is reached. For a quantum 
ferrofluid, a single droplet should be unstable for a < aga, owing to 
the attractive part of the dipolar interaction, and so should collapse. 
Although, the counteracting quantum pressure—the zero-point energy 
that exists as a result of an external trapping potential—can compen- 
sate attraction and prevent collapse”®, mean field calculations'® predict 
this not to be the case. Our observation of stable droplet ensembles 
is therefore striking, and further work is needed to understand their 
stability. A possible stabilizing effect is that of quantum fluctuations, 
leading to beyond mean-field effects””. Such stabilization has been sug- 
gested in a similar situation of competing attraction and repulsion”®, 
and an increased effect of quantum fluctuations has been calculated 
for strongly dipolar gases”. 

As further quantitative statistical analysis, we computed the 
Fourier spectrum S(k) of the obtained images (Fig. 2a—c). The pat- 
terns are visible as a local maximum in S(k) at finite momentum 
k=20/d2.5 uml, whereas the spectrum of a BEC monotonically 
decreases with k. We define the spectral weight 


5 jum! 


sw= >> 


k=1.5 pm7! 


S(k) 


which is a quantity that represents the strength of the structured states, 
and normalize it such that a BEC has SWpec= 1. After a quench of 
the interactions from a~agq to a © apg, we statistically investigated the 
pattern-formation time and the lifetime of these patterns (Fig. 2d). We 
repeated this measurement 13 times and found statistically that the 
pattern is fully developed after 7 ms, and has a 1/e lifetime of about 
300 ms. The decay of the droplet structure is accompanied by a decrease 
in the number of atoms, with a 1/e lifetime of about 130 ms, while the 
residual thermal cloud is constant. Owing to the decreasing number 
of atoms, the structures evolve back to lower numbers of droplets, Na, 
until they merge back into one droplet (insets of Fig. 2d). In compari- 
son, because we measured lifetimes of a non-structured BEC of more 
than 5 s, we assume increased three-body losses as a reason for the 
reduced lifetime. One indication of this is the measured atomic peak 
density for droplets of n> 5 x 10°? m~?, which is greater than the den- 
sity of a BEC, n= 107°m~?. 

To explore the nature of this instability, we performed the following 
experimental sequence, depicted in Fig. 3a. We prepared the BEC close 
to the Feshbach resonance with a~agq and ramped the magnetic field 
linearly to varying values near the instability point. We ensured that the 
structures were formed within 10 ms, even for values of the magnetic 
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a_ Spatial density distribution 


b_ Fourier transform S(k,, k,) c 


Spectrum S(k) Figure 2 | Evaluation of the structures and 
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lifetime analysis. a—c, Illustration of our statistical 
evaluation procedure. We began with the spatial 
density distribution (a), and then calculated the 
absolute value of the two-dimensional Fourier 
transform S(k,, ky) (b) and radially averaged over 
k= (ke + ki)? to obtain the spectrum S(k) (c). 
We removed the white noise from the spectrum 
S(k) such that S(k) =0 for k>7\1m~!, which 
corresponds to structures below our resolution. 
The spectra in c represent an average of 13 images 
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for both BECs and patterns at a wait time of 10 ms. 
For patterns in the spatial density distribution, we 
observe an enhanced signal for k~ 2.5m“! in the 


spectrum (red line), whereas the spectrum of BECs 
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(black line) shows monotonic decay for increasing 
momentum. We define the sum of these spectra 
over a momentum range as the relative spectral 
weight SW (shaded areas, as defined in the text), 
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which is a quantity for the strength of the 
structured states. d, We performed a sudden 
quench (over 0.5 ms) of the BEC below the 
instability to B= 6.656(3) G, for varying wait times. 
To determine the creation time and the lifetime of 
the patterns, we plot the relative spectral weight 
SW (red diamonds) against wait time. Each point is 
an average of 13 realizations, with error bars 
indicating the standard error. The plot shows rapid 


10,000 


swoje Jo sequNN @ 


5,000 


1 n 1 
200 400 
Wait time (ms) 


field close to the stability threshold, and waited at the chosen value for 
twice this time. Figure 3b shows a clear hysteresis for the way down in 
magnetic field compared to the return. For the return, we observe the 
same spectral weights as for the way down, but for magnetic fields that 
are about 20 mG greater. This demonstrates that our system features 
bistability in the transition region. We expect that the transition from 
one state to the other is driven by thermal excitations or weak cur- 
rents due to overlapping droplets. In the thermodynamic limit, such 
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pattern formation within 7 ms and a surprisingly 
long 1/e lifetime of about 300 ms. This lifetime 
seems to be limited by a decrease in the number of 
atoms (blue circles). The insets are typical spatial 
density distributions of single samples before 
pattern formation (BEC; bottom left), and at three 
different wait times. 


behaviour is a clear signature of a first-order phase transition and a 
latent heat in the crystallization process. 

To verify that we are dealing with a transition to a ground state 
and not a metastable state resulting from quench dynamics, we per- 
formed forced evaporative cooling at a constant magnetic field far away 
from any Feshbach resonance with a ~ ap. We observed very similar 
self-organized structures, which were visible for temperatures near the 
expected critical temperature for the phase transition to a BEC. 


Figure 3 | Hysteresis of pattern formation. a, A timeline of the experiment 
in which we observed hysteresis. We prepared the Dy BEC close to the 
Feshbach resonance at B = 6.962(3) G and ramped down the magnetic field 
linearly to different values, with a constant change rate, the lowest of which 
was B= 6.860(3) G. To ensure that the structures had enough time to form, 
we waited for 20 ms at each value before imaging the atomic sample in situ. 
For the way back, we first waited at the lowest field value for 20 ms and then 
increased the magnetic field with the same ramp speed to higher values, 

once again holding for 20 ms at each value before imaging the sample in situ. 
b, Hysteresis plot for the structured patterns, which shows the spectral weight 
SW against magnetic field as it is decreased (red diamonds and line) and then 
increased (green squares and line). Each point is an average of 14 realizations, 
with the vertical error bars indicating standard errors. We determined the 
long-term field stability to be 3 mG, as shown by the horizontal error bars 

for selected points. A clear hysteresis is visible, although the total time is 
twice as long for the way back. The labels 1-6 in a and b indicate data points 
at particular field values to help the understandability; the lines in b serve as 
a guide to the eye. 
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Because structures can melt back into a BEC, and because we 
observed evaporative cooling to patterns, it is quite plausible that the 
droplets are superfluid individually. Whether they share the same 
phase via weak links or lose their mutual phase coherence will have 
to be investigated in future experiments. Another open question is the 
creation dynamics ofa self-organized structure. It will be interesting 
to study phonons in such a droplet crystal, which we expect to have 
eigenfrequencies of the order of many inverse lifetimes. We also expect 
that these phonons will be coupled to collective Josephson oscillations 
via weak links*®. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Preparation of a Dy BEC. Bosonic !!Dy atoms are first cooled in a narrow-line 
magneto-optical trap, operating on the 626-nm optical cycling transition, which 
polarizes the atoms to the lowest Zeeman state J= 8, m;=—8, and subsequently 
directly loaded into a single-beam optical dipole trap (ODT) created by a 
broadband fibre laser operating at 1,070 nm with a power of 72 W. By moving 
the focusing lens of the transport ODT over a range of 375 mm, we transport the 
atoms into a glass cell. After the transport, we load the atoms from the transport 
ODT into a crossed optical dipole trap created by a single-mode laser operating 
at 1,064nm. Following this, the atoms are Doppler cooled with 626-nm light, and 
we utilize forced evaporative cooling by ramping down the power of both trapping 
laser beams at a magnetic field of B= 1.012 G far away from any resonance. When 
close to degeneracy, we tune the magnetic field close to a Feshbach resonance and 
achieve there a quasi-pure BEC with typically 15,000 atoms and a temperature 
of T=70nkK. Before shaping the final trap with harmonic frequencies of (Vv, v,, 
Vz) = (46, 44, 133) Hz, we apply an additional magnetic gradient of 1.1 G cm! to 
partially compensate gravity. 

Feshbach resonance. The background scattering length of ‘“‘Dy has been meas- 
ured to be apg = 92(8)ao, in which ao is the Bohr radius™*”. In the vicinity of a 
Feshbach resonance, the scattering length scales as a= ap¢(1 — AB/(B— Bp)), with 
B the applied magnetic field, By the centre and AB the width of the Feshbach 
resonance. To tune the scattering length a, we used a narrow resonance at 
Bo=7.117(3) G with AB=51(15) mG. We calibrated the magnetic field with a 
weak radiofrequency field driving transitions between magnetic sublevels. To 
measure the resonance position, we use atom trap-loss spectroscopy and find the 
maximal loss at the central position Bo of the Feshbach resonance. To estimate the 
width of the resonances we did a thermalization experiment at different magnetic 


field positions and found a maximum temperature at the field position By + AB 
(Extended Data Fig. 1b). Close to the maximal temperature we observed another 
narrow resonance that influences our width measurement. Hence, we cannot 
state precise values for the scattering length. In addition, we did the same inves- 
tigations for the structured states on a second, even narrower resonance located 
at Bo = 1.326(3) G with a width of AB= 8(5) mG (Extended Data Fig. 1a). We 
observed the same qualitative results with this resonance, which indicates that 
self-organization is independent of a particular Feshbach resonance. 

In situ phase-contrast polarization imaging. Phase-contrast polarization imag- 
ing was first introduced with lithium atoms” and relies on the dispersive phase 
shift instead of direct absorption giving rise to the optical density. We use an 
off-resonant beam (A = —1.1 GHz= —35/) near the strongest optical transition 
at 421 nm with a linewidth of [’~ 32 MHz to suppress absorption. The imaging 
beam is linearly polarized and we apply a magnetic field of more than 1 G in the 
beam propagation direction so that the atoms see a mixture of left- and right- 
handed circularly polarized light. In our case, the Dy atoms are fully spin-polarized 
in the lowest-lying Zeeman state m;= —8. Thus, the atoms couple mainly to the 
a optical transition, and the o* transition is suppressed by a factor of about 150, 
owing to the difference in the Clebsch-Gordan coefficients. Hence, the atoms show 
a strong circular birefringence or optical rotation. If both coupled and uncoupled 
polarizations are combined on a linear polarizer, then the transferred intensity 
distribution depends on the dispersive shift of the atoms. The atomic plane is then 
imaged through a commercial diffraction-limited objective (numerical aperture of 
0.32), corrected for the upper window of our glass cell, with a resolution of 11m. 
With a second commercial objective, the image is magnified by a factor of 50 and 
guided through a polarizer to the camera. 

Sample size. No statistical methods were used to predetermine sample size. 
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maximized at By + AB, with AB=8(5) mG. We prepared stable BECs 
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Observation of polar vortices in oxide superlattices 


A.K. Yadav)*, C. T. Nelson'3**, S. L. Hsu, Z. Hong®, J. D. Clarkson!3, C. M. Schlepiietz®, A. R. Damodaran'’, P. Shafer’, 
E. Arenholz’, L. R. Dedon!, D. Chen), A. Vishwanath?’, A. M. Minor!*4, L. Q. Chen’, J. F. Scott®, L. W. Martin!* & R. Ramesh! 


The complex interplay of spin, charge, orbital and lattice degrees 
of freedom provides a plethora of exotic phases and physical 
phenomena! ©. In recent years, complex spin topologies have 
emerged as a consequence of the electronic band structure and 
the interplay between spin and spin-orbit coupling in materials®’. 
Here we produce complex topologies of electrical polarization— 
namely, nanometre-scale vortex—antivortex (that is, clockwise- 
anticlockwise) arrays that are reminiscent of rotational spin 
topologies°—by making use of the competition between charge, 
orbital and lattice degrees of freedom in superlattices of alternating 
lead titanate and strontium titanate layers. Atomic-scale mapping 
of the polar atomic displacements by scanning transmission 
electron microscopy reveals the presence of long-range ordered 
vortex-antivortex arrays that exhibit nearly continuous 
polarization rotation. Phase-field modelling confirms that the 
vortex array is the low-energy state for a range of superlattice 
periods. Within this range, the large gradient energy from the 
vortex structure is counterbalanced by the corresponding large 
reduction in overall electrostatic energy (which would otherwise 
arise from polar discontinuities at the lead titanate/strontium 
titanate interfaces) and the elastic energy associated with epitaxial 
constraints and domain formation. These observations have 
implications for the creation of new states of matter (such as 
dipolar skyrmions, hedgehog states) and associated phenomena 
in ferroic materials, such as electrically controllable chirality. 
The ability to synthesize heteroepitaxial complex oxides has ena- 
bled unprecedented access to high-quality, single-crystalline materi- 
als and routes to manipulate order parameters®”. In ferroelectric 
materials (that is, those possessing a spontaneous, switchable electrical 
polarization), epitaxial strain and advances in layer-by-layer growth 
techniques have enabled the study of the fundamental limits of ferro- 
electricity!'' and the discovery of novel interfacial phenomena’. In 
short-period (of only a few unit cells) superlattices of PbTiO3/SrTiO3, 
researchers observed the emergence of ‘improper’ ferroelectricity aris- 
ing from octahedral tilts in the SrTiO; layer'*"'® and this has motivated 
a number of additional studies of such effects'’. At relatively large 
length scales (greater than 5 nm), interfaces still play a formative role, 
driving both bulk’® and thin-film’?! ferroelectrics to form patterns 
of flux-closure polar domains, whose microstructures have been the 
topic of extensive research including atomic-scale polarization stud- 
ies!???3. At intermediate length scales (tens of unit cells), theoretical 
studies have suggested that there is potential for topological structures 
such as vortices, waves and skyrmions**, depending on the interplay 
between strain, depolarization and gradient energies. In this work, 
we demonstrate ordered arrays of polar vortices, made up of vortex- 
antivortex pairs, in ferroelectric/paraelectric superlattices. 
Symmetric (SrTiO3),,/(PbTiO3),, superlattices with n = 2-27 were 
synthesized on DyScO3 (001)p< (where pc refers to the pseudocubic 


notation) substrates via reflection high-energy electron diffraction 
(RHEED)-assisted pulsed-laser deposition (details of the growth are 
provided in Methods and Extended Data Fig. 1). Superlattices are 
henceforth referred to using the ‘n x n’ shorthand wherein n corre- 
sponds to the thickness of the SrTiO3 and PbTiO; layers in unit cells 
(structures depicted schematically on the right of Fig. 1a). Chemical 
analysis via Rutherford backscattering spectrometry (RBS) confirms 
stoichiometric PbTiO; within the detectability limits of the technique 
+1%, Extended Data Fig. 2). A typical low-magnification, scanning 
transmission electron microscopy (STEM, details in Methods) image 
of the cross-section of a 10 x 10 superlattice taken along the [010], 
zone axis reveals the layer uniformity (Fig. la), and atomic-scale 
high-resolution STEM (HR-STEM) confirms sharp and coherent 
interfaces (Extended Data Fig. 3a). X-ray diffraction studies including 
symmetric Bragg scans (Fig. 1b) and reciprocal space maps (RSMs, 
Fig. 1c) reveal superlattice reflections in the out-of-plane direction (L, 
[001],,.) corresponding to the superlattice period of 9-10 nm. Side- 
lobe diffraction peaks are also observed along the in-plane direction 
(K, [010],.) corresponding to a 9-10 nm periodicity (arising from a 
polar ordering which is discussed in the following section). 
Mapping of the atomic polar displacement (Ppp) was performed 
to determine the polarization distribution within the superlattices. 
A displacement vector-mapping algorithm” (details in Methods and 
Extended Data Figs 3 and 4) was implemented on the cross-sectional 
HR-STEM images to measure local non-centrosymmetry of the lattice. 
A vector map of these polar displacements within a 10 x 10 superlattice 
(Fig. 2a) shows the formation of long-range, ordered arrays of vortex 
structures with alternating rotation directions. The lateral perio- 
dicity is approximately the same as the superlattice period (~10nm), 
closely matching the in-plane (K, [010],-) periodicity observed in 
the RSMs. These vortices exhibit a continuous rotation of the local 
polarization vector, in contrast to segregation into uniform domains 
and domain walls typical of ferroelectric domain and flux-closure 
structures!?~?!. Asa consequence, a dominant fraction of the PbTiO; 
layers exhibit a continual rotation of Ppp, as illustrated in maps of the 
curl (that is, vorticity) of the displacement vector field (V x Ppp) 010] 
(Fig. 2c; additional details are provided in Methods and the full curl 
map corresponding to Fig. 2a is found in Extended Data Fig. 5a). 
To better understand the energetics of the vortex—antivortex states, 
we carried out phase-field modelling of the polarization structure 
as a function of superlattice periodicity (details in Methods). The 
phase-field-calculated polarization maps for the same 10 x 10 super- 
lattice indicate the formation of a vortex—antivortex ground state 
bearing close resemblance to the experimental observations (Fig. 2d). 
Both in- and out-of-plane long-range vortex ordering is observed 
by electron diffraction and dark-field transmission electron micros- 
copy (DF-TEM) along cross-section and planar views (Fig. 3a and b, 
respectively). The cross-sectional DF-TEM images show a long-range 
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Figure 1 | Structural characterization of 
(SrTiO;),,/(PbTiO3),, superlattices. a, Low- 
magnification, STEM image of the cross-section 
of a (SrTiO3)10/(PbTiO3) 10 superlattice, together 
with a schematic illustration of the superlattice 
structure (right). The PbTiO3 (top) is illustrated 
for the case of a [001]. (upward) polarization 
which results from the downward displacement 
of the Ti-‘centred’ oxygen octahedra relative to 
Pb (red, blue, green and gold spheres indicate 


Pb, Ti, Sr and O atoms, respectively; blue- and 
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green-shaded regions surrounding Ti atoms 
show oxygen octahedra). b, High-resolution 
0-20 symmetric scan of a 100-nm (SrTiO3)j0/ 
(PbTiO3) i superlattice showing the presence 

of Pendellésung fringes, which attest to the 
smoothness of the surface and the high quality of 
the 100-nm-thick superlattices (see Methods for 
details and nomenclature used for peak labels). 
c, Synchrotron-based, in-plane X-ray RSM study 
of a (SrTiO3))2/(PbTiO3) 12 superlattice about 

the DyScO3 011,, diffraction condition. The 
asymmetric L-K scan reveals satellite peaks 
corresponding to vortex—antivortex pairs with 


periodic array of intensity modulation which corresponds to the vortex- 
antivortex pairs with a spacing of 9-10 nm both in and out of the 
plane of the superlattice. Like the X-ray RSM studies, the long-range 
order also manifests as additional superlattice reflections in the selected- 
area electron diffraction (SAED) patterns in both cross-section and 
planar views (see the magnified 001,, and 050,, reflections from the 
SAED patterns in Fig. 3a inset and Fig. 3b inset, respectively). The 
uniform array of stripes in the planar-view DF-TEM (Fig. 3b), again 
with a 9-10 nm periodicity, suggests that the vortices extend over 
distances of at least 1 1m. We also note that vortices are observed to 
exist in both in-plane directions (Extended Data Fig. 6). 

Phase-field modelling of the PbTiO3/SrTiO; superlattices was per- 
formed to study the energetics and to ascertain the 3D structure of 
the vortex—antivortex pairs. A range of symmetric superlattice perio- 
dicities were simulated, allowing each to evolve from a random initial 
state according to the time-dependent Ginzburg-Landau equation 
(details in Methods). The 3D structure of the 10 x 10 superlattice, 
corresponding to our experimental HR-STEM studies, is shown in 
Fig. 4. With no a priori assumption of the polarization (P) distribu- 
tion, the model converges to a vortex—antivortex array structure. For 


~10nm periodicity. 


comparison, sub-regions of the HR-STEM and planar-view DF-TEM 
images are projected on the same 3D axes as the model (Fig. 4). The 
tubular, ordered vortices match the experimentally observed struc- 
tures closely. The polarization rotation extracted from the model is 
mapped onto the front plane and in 3D isosurfaces (Fig. 4) using 
a blue/red colour scale corresponding to alternating clockwise 
(vortices) and anticlockwise (antivortices) according to the 
(V x P)jo10) curl of the polarization. Two additional slices through 
the 3D structure show the component of polarization along [100]. 
(Piio0}) and [001],¢ (P{oo1}). Like the polar displacements from the 
HR-STEM, the model lacks discrete domains and, instead, exhib- 
its a smooth rotation of the polarization. The long-range order is 
also in good agreement, preferring interlayer alignment of the vor- 
tices, particularly with matching rotation directions (Extended Data 
Fig. 5c). 

On the basis of both the experimental results and the phase-field sim- 
ulations, the observed vortex structure results (primarily) from compe- 
tition between three energies. The first is the elastic energy, which arises 
from the fact that the PbTiO; is epitaxially constrained (in a tensile 
fashion) by the DyScO; substrate which traditionally drives the PbTiO; 


Figure 2 | Observation of vortex-antivortex 
structures. a, Cross-sectional HR-STEM image 
with an overlay of the polar displacement 
vectors (Ppp, indicated by yellow arrows) for a 
(SrTiOs3)10/(PbTiO3) 10 superlattice, showing that 
an array of vortex—antivortex pairs is present 

in each PbTiO; layer. b, A magnified image of 

a single vortex—antivortex pair, showing the 

full density of data points (one for each atom) 
and the continuous rotation of the polarization 
state within such vortex—antivortex pairs. c, The 
curl of the polar displacement (V Ppp) jo10} 

for the same vortex—antivortex pair reveals the 
alternating rotation directions of the structures 
(see details in Methods and Extended Data Fig. 
5a and b). The (V x Ppp)jo10] curl value is plotted 
with a red/blue colour scale where no-vorticity 
(curl = 0) is white, clockwise (negative) is blue 
and anticlockwise (positive) is red. d, Polarization 
vectors from a phase-field simulation of the 
same (SrTiO3)19/(PbTiO3) 9 superlattice, which 
predicts vortex—antivortex pairs that closely 
match the experimental observations. 
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to adopt a mixture of in-plane Pi100}/010) and out-of-plane Pjo91) polar- 
izations (that is, a c/a domain structure). The second major contribution 
is electrostatic energy from built-in electric fields. As one transitions 
from the PbTiO; to the SrTiO; layer there is likely to be a large polar 
discontinuity normal to the interface and thus there is a strong driving 
force for bound charges since V - P + 0. Finally, the third major energy 
in competition is the gradient energy. This is the energy required to 
rotate or change the direction or magnitude of the polarization. All told, 


Z (nm) 


Figure 4 | Three-dimensional phase-field simulation for a (SrTiO3)10/ 
(PbTiO3) 109 superlattice. The 3D geometry of the vortex—antivortex 

array from phase-field simulation of a (SrTiO3)19/(PbTiO3) 19 superlattice 
is shown on the right. The front cross-section of the model shows the 
polarization vector P ordered into clockwise (blue) and anticlockwise (red) 
vortex—antivortex states which extend along the [010],,. For comparison, 
on the left is shown a cross-sectional HR-STEM image overlaid with 
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Figure 3 | Long-range ordering of vortices. a, Main panel, cross-sectional 
DF-TEM images of a (SrTiO3);0/(PbTiO3) 10 superlattice revealing 
long-range ordering of vortices (each bright/dark modulation corresponds 
to a period of the vortex—antivortex structure). Inset, SAED pattern for the 
sample with the g-vector of the two-beam imaging condition indicated by 
the yellow arrow. Close inspection of the SAED pattern reveals superlattice 
reflections both in and out of the plane, suggesting long-range ordering of 
the vortex structures (boxed area of inset with 001,, reflection is shown 
magnified at right). b, Planar-view DF-TEM imaging of a (SrTiO3)16/ 
(PbTiO3)i6 superlattice which likewise exhibits long-range in-plane 
ordering associated with the vortex—antivortex pairs along [100], and 
confirms the widespread occurrence of these vortex pairs in the 
superlattice films. The upper inset shows the SAED pattern for the sample 
with the g-vector of the two-beam imaging condition indicated by the 
yellow arrow. Again, close inspection of the diffraction pattern (boxed area 
of inset shown magnified) reveals in-plane superlattice reflections 
indicating long-range ordering of the vortex structures. The lower inset 
shows a magnified view of the vortex structures, again revealing ~10 nm 
periodicities. 


in this system, these three energies combine to create a new topology 
of polarization—the system localizes the in-plane Pjj00}/{010) 
components at the PbTiO;/SrTiO; interfaces and forms alternating 
out-of-plane P{o91; components mid-layer—thereby creating a rotating 
polarization. 

It should also be noted that the balance of these terms, and thus 
the ground-state structure, is strongly dependent on the superlattice 
period. In short-period superlattices (n < 10), the vortex becomes 
unstable and, instead, the PbTiO3 minimizes the electrostatic 
and gradient energy at the expense of elastic energy by adopting only 
in-plane Pr100}/{010) Components (that is, a,;/a, domain structures). 
As the superlattice period increases to intermediate length scales, 
a discontinuity in the total energy appears in the phase diagram 
(derived from the phase-field simulations and total energy calcu- 
lations) and thus a co-existence of the a,/az and vortex—antivortex 
structures is observed. In long-period superlattices (n = 18), the sys- 
tem begins to evolve into classical flux-closure domain structures 
characterized by the formation of discrete domains surrounding 
localized polarization gradients at domain walls. A side-by-side 
comparison of HR-STEM polar displacement maps and phase-field 


Anticlockwise 


Clockwise 


a polar displacement vector map and a planar-view DF-TEM image 
projected onto the front and top planes of the axes, respectively. Red/blue 
colour scales correspond to the curl of the polarization extracted from 
the phase-field model and the HR-STEM polar displacement map (Ppp). 
The scale bars on P, and P, are from —44.81 to 44.81 (jC cm~’) and from 
—54.24 to 54.24 (uC cm’), respectively, whereas the vorticity is from 
—129.36 to 129.36 (uC cm? nm~!), 
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simulations for vortex (10 x 10) and flux-closure (50 x 50) structures 
is provided (Extended Data Fig. 7); we also show the superlattice- 
period-dependent evolution of structures from phase-field mod- 
els (Extended Data Fig. 8). In summary, the continuous curling of 
polarization allows the simultaneous achievement of electrostatically 
favoured, non-diverging polarization and a strain-favoured mixture 
of in-plane and out-of-plane components of polarization, stabilizing 
novel vortex states. 

These observations beget several important questions and avenues 
for future exploration. First, the vortices are observed to be slightly 
asymmetric (the cores not being perfectly centred and equidistant, 
Fig. 2a), which could be a source of chirality and potentially used to 
produce tunable optical behaviour. Second, although the long-range, 
in-plane order of the vortex structures is quite robust, the out-of- 
plane order is somewhat less so. This out-of-plane ordering is likely 
to be a low-energy excitation (akin to ferroelectric liquid crystals and 
Abrikosov lattices in superconductors) and consequently to exhibit a 
certain degree of disorder and the potential for multiple ground states, 
which may be a rich area for exploration of the macro-scale ordering by 
tuning the interlayer stacking. Last, the generation of vortex topologies 
could be extended to other material systems and other symmetries—for 
example, a multiferroic BiFeO3 system could also provide the possibility 
of enhancing the canted moment arising from Dzyaloshinski-Moriya 
interactions. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Material system. The materials used in this study are lead titanate (PbTiO3), stron- 
tium titanate (SrTiO3), strontium ruthenate (SrRuO3;) and dysprosium scandate 
(DyScO3). PbTiO; has a tetragonal structure with space group P4mm and bulk 
lattice parameters*! a= 3.904 A and c= 4.178 A. PbTiOs is a ferroelectric at room 
temperature with an axis of spontaneous polarization along [001] and undergoes 
a phase transition*! from a tetragonal (ferroelectric) to a cubic (paraelectric) struc- 
ture at 490°C in the bulk. SrTiO3, on the other hand, is a band insulator at room 
temperature with a cubic structure having a bulk lattice parameter of a=3.905 A 
and space group Pm3m. SrRuO; is a ferromagnetic metal at room temperature 
with an orthorhombic structure having space group Pbnm and bulk lattice param- 
eters” of a=5.567 A, b=5.530 A and c=7.845A. DyScO; is a band insulator at 
room temperature and has an orthorhombic structure with space group Pbnm and 
bulk lattice parameters™? of a=5.440 A, b=5.717 A and c=7.903 A. The orienta- 
tion between the pseudocubic indices used in the text and the orthorhombic indi- 
ces of the DyScO3 substrate are (001)pc || (110), (100) p¢ || (110)o, (010) p¢|| (001)o. 
Synthesis of the superlattices using RHEED-assisted pulsed-laser deposition. 
Superlattices of (SrTiO3),,/(PbTiO3),, were synthesized on 5nm SrRuO3-buffered, 
single crystalline DyScO3 (001)p_ substrates via pulsed-laser deposition. The 
growth temperature and oxygen pressure for the growth of the superlattices was 
630°C and 100 mtorr, respectively, to ensure stoichiometric transfer of both PbTiO; 
and SrTiO3. Growth of the SrRuO3 was accomplished at a growth temperature of 
700°C and an oxygen pressure of 50 mtorr. All depositions were performed at a 
laser fluence of 1.5} cm. The growth was monitored using reflection high-energy 
electron diffraction (RHEED) to ensure maintenance of a layer-by-layer growth 
mode throughout the growth process (Extended Data Fig. 1) which, in turn, ena- 
bles controlled growth of the superlattices with different period-thicknesses while 
achieving the same total thickness. The RHEED oscillations were monitored for 
specular spots along [100], of the DyScO3 (001), substrate. After the growth, the 
samples were cooled to room temperature in 50 torr of oxygen to promote full oxi- 
dation. The use of a Pb; ,TiO; target was found to be critical in enabling persistent 
layer-by-layer growth (evidenced by RHEED oscillations obtained throughout the 
entire growth process, Extended Data Fig. 1), whereas the use of a stoichiometric 
PbTiO; target resulted in rapid transition to 3D (island) growth. 
Structural characterization. Laboratory-based X-ray diffraction. Superlattices were 
characterized with a Panalytical X’Pert Pro X-ray Diffraction (XRD) machine with 
a Cu source. Symmetric XRD scans (Fig. 1b) were extensively performed to fully 
characterize the structural order in the out-of-plane direction, [001]. in these 
superlattice structures. The superlattice period achieved via RHEED-controlled 
growth was later confirmed using high-resolution 6-20 symmetric XRD scans 
(see, for example, Fig. 1b). The presence of thickness fringes (Pendelldsung fringes) 
around both the central Bragg peak (SL(o) 002) and the superlattice peaks (SL(_1) 
002 and SL;,1) 002) are clearly visible in 10 x 10 superlattices (Fig. 1b) and attest 
to the high crystallinity of the films, the precise nature of the interfaces, and the 
overall smoothness of these structures. 
Synchrotron X-ray diffraction. This was employed to study the structure (both 
in and out of the plane) of the superlattices. The diffraction studies were per- 
formed at the Sector 33-BM-C beamline of the Advanced Photon Source, Argonne 
National Laboratory. The high flux from a synchrotron source delivered at this 
beamline was optimal for extracting lattice modulations associated with these 
vortex-antivortex pairs. In order to obtain a highly monochromatic beam with 
negligible higher-order harmonics, a double crystal monochromator in conjunc- 
tion with two mirrors was used. Moreover, excellent accuracy of a Huber 4-circle 
diffractometer in combination with a PILATUS 100K pixel detector allowed us 
to determine the orientation of the crystal reliably and obtain 3D RSMs with 
high accuracy. The X-ray wavelength used for obtaining the RSMs was 0.8 A 
(E=15.5keV). 
Chemical analysis by RBS. This was employed to confirm the stoichiometry of 
the PbTiO; at the optimized growth conditions and to eliminate any spurious 
effects of non-stoichiometry on the observation of these vortex structures. The 
results for a 200 nm PbTiO; film are provided (Extended Data Fig. 2). These results 
and the corresponding fit to the data confirm the chemical composition and homo- 
geneity of the films within the limits of experimental errors. The spectra were taken 
with a 3.04 MeV He ion beam incident at an angle of 22.5° relative to the sample 
normal. The backscattered He ions were collected by a silicon surface-barrier 
detector positioned at 168° with respect to the incident beam. The SIMNRA soft- 
ware package was used to simulate the RBS spectra and obtain the composition of 
the film. R? analysis was performed about the Pb and Ti peaks to avoid artificially 
increasing the value of R’ with the inclusion of substrate peaks. R? was calculated 
using R? = 1 — ea where e; and f; values correspond to experimental and 
ej-@ 
simulated data points, respectively, for different channel energies (indicated by 
variable i), and @ indicates the mean of experimental data points. 


Conventional and scanning transmission electron microscopy. TEM samples 
were prepared by mechanical polishing on an Allied High Tech Multiprep. Cross- 
sectional samples were polished at a 0.5° wedge, and planar samples with a flat 
polish. Samples were subsequently Ar ion milled using a Gatan Precision Ion 
Milling System with starting energies of 4keV stepped down to a final cleaning 
energy of 200eV. 

Diffraction contrast TEM (Fig. 3) was performed on a JEOL 3010 at 300KV. 
Dark field imaging was performed in the two-beam condition, the diffracted beam 
travelling along the optical axis of the TEM. The samples were tilted off zone-axis 
to isolate selected crystallographic planes used for imaging: g=002,, for Fig. 3a 
and Extended Data Fig. 6a; g= 020), for Figs 3b and 4; and g=002,, for Extended 
Data Fig. 6b. The tilt condition used for Fig. 3a and Extended Data Fig. 6b is shown 
by the SAED pattern (inset) in the latter. SAED patterns inset in Fig. 3a and b were 
collected along the zone axis. 

Scanning TEM was performed on a Cs-corrected TEAMO.5 FEI Titan micro- 
scope at 300kV using a high-angle annular detector resulting in ‘Z-contrast’ 
images. Raster distortion was minimized using a pair of HR-STEM images of the 
same region taken with orthogonal scan orientations. These were used to calculate 
a corrected ‘pre-distortion’ image to minimize the slow scan axis errors™. 

Determination and mapping of the polar atomic displacements (Ppp) was per- 
formed on the corrected HR-STEM HAADF images (Extended Data Fig. 3a, b) 
using local A- and B-site sublattice offset measurements”. The rationale is that for 
many displacive perovskite ferroelectrics the electric dipole moment is manifested 
by relative shifts of the cations, as is the case for PbTiO, and this offset can be 
used to infer the full polarization (P). This is very helpful since the anions (oxy- 
gen) needed to fully characterize an electric dipole have relatively weak electron 
scattering resulting in low signal-to-noise ratio, especially for HAADF images. In 
room-temperature bulk PbTiOs, the Ti B-site is offset by 16.2 pm and the oxygen 
by 47-49 pm relative to the Pb lattice in the direction opposite to the polarization®. 
Thus, we use the Ti to infer the offset of the Ti-‘centred’ oxygen octahedra for 
defining the electric dipole. Displacement vectors (Ppp) corresponding to local 
offsets between the A- and B-site sublattices were calculated by first determining 
atomic positions by fitting each atom site in the image to four-parameter, spherical 
Gaussians using a trust-region algorithm in Matlab (see illustration in Extended 
Data Fig. 3c). Fits were performed simultaneously for five-atom clusters of the 
B-site-centred perovskite unit cell (Extended Data Fig. 3d). The displacement for 
each atom was then calculated as the difference between its atom position and 
the mean position (that is, centre of mass) of the surrounding four opposite-type 
cations (Extended Data Fig. 4c, d). Using a pseudocubic grid indexing (A-sites 
defining a grid) and defining the cross-section image to lie in the x-z Cartesian 
plane (x= [100],., = [001],,), for an atom at grid position i,j this corresponds to 
mean neighbour positions (MNP) of: 


MNPx j= (Xj—0.5,j—0.5 + Xi—0.5,j+0.5 + Xi+0.5,j+0.5 + Xi40.5,j—0.5)/4 


MNP2z4 = (2i-0.5,j-0.5 + Zi-0.5,+0.5 + Zi+0.5,j+0.5 + 2i+0.5,j-0.5)/4 


Illustrative examples for both A-site and B-site regions are shown in Extended Data 
Fig. 4. To maintain a consistent displacement vector sign, we reverse the order for 
A-sites and B-sites such that: 


Ppp, ij A-site = (xi = MNPx;)X Te (Zi = MNPzj)Z 


Ppp, ij B-site — (MNPx;j — xy)X or (MNPzj = Zi)Z 


(as illustrated in Extended Data Fig. 4d). This convention defines a displacement 
vector direction nominally parallel to the full electrical polarization of the bulk crystal 
structure. Noise in the displacement vector fields was reduced by weighted smooth- 
ing of the first derivative over a local range of 12 atoms (calculation details below). 

A conservative estimation of the experimental error in the polar displacement 
maps for the data set in this work was performed by statistical analysis of the dis- 
tribution of the polar displacements within the SrTiO; layers. Ppp for the Sr and Ti 
atoms within the SrTiO; layers (excluding the interfaces) has an ogms = 4.88 pm. 
This represents an upper bound on the experimental error, as it is predicated on 
all measured non-centrosymmetric distortions of SrTiO; being the result of noise. 
Any actual distortions, be they induced by the PbTiO; epitaxy or resulting from 
defects, are falsely included in this estimate. 

The curl vector (or vorticity) of the displacement vector field Ppp in the TEM 


plane is the y component of V x Ppp, or Aebe < PP Pps 
Zz 


— ) 9 (again the TEM image 


is defined as the x-z plane, y is the direction of the electron beam). The polarization 


OPPD.z and OP pp,x 


gradient terms were calculated by a weighted least-squares fit of 


x yA 
a two-variable (x and z) first-order polynomial to Ppp,. and Ppp, at each atom site 
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with a fit area extending to 12 surrounding atoms. More succinctly, the local Ppp 
and Ppp, values were fitted as a plane, and the derivatives are taken as the x and z 
slopes of the plane. A Gaussian function with a o of the mean grid spacing was 
used as the weighting function, that is, the central atom was given a weight of 1 
and decreased to a weight of ~0.1 for the most distant of the 12 neighbour atoms 
considered. The polynomial fit was used as a smoothing function for the vector 
fields displayed in the figures, effectively smoothing the first derivative, and the 
gradients input into the curl equation. The curl vectors calculated and smoothed 
in this manner for the HAADF image in Fig. 2 are shown in Extended Data Fig. 9a 
for all atom positions. A comparison between the smoothed and raw polar 
displacements for one of the vortices is shown in Extended Data Fig. 9b and d, 
respectively. 

Phase-field simulations for superlattice structures. In the phase-field modelling 
of the superlattice films, the evolution of the polarization was obtained by solving 
the time-dependent Ginzburg-Landau (TDGL) equations: 

OP((r, t) oF 


i: een @) 


where L, r, and t denote the kinetic coefficient, spatial position vectors, and time, 
respectively. The contributions to the total free energy F include the Landau bulk 
energy, elastic energy, electric (electrostatic) energy, and gradient energy, that is: 


F= J Canaan “Ffelastie + forectric +Foradient \AV (2) 


The expression of each energy density can be found in the literature****. Owing 
to the inhomogeneity of elastic constants in the (SrTiO3),,/(PbTiO3),, superlattices, 
a spectral iterative perturbation method was employed to solve the mechanical 
equilibrium equation to obtain the stress field*’. The pseudocubic lattice constants 
for PbTiO; and SrTiO; were taken as 3.9547 A and 3.905 A, respectively’, while 
the anisotropic in-plane lattice constants for the substrate DyScO; is taken from 
literature to calculate the misfit strain**. Material constants for PbTiO3 and SrTiO; 
used in the simulations are found in the literature and these include the Landau 
potentials, elastic constants, electrostrictive coefficients, background dielectric 
constants, and gradient energy coefficients?”4°-", 

The 3D phase-field simulation of the (SrTiO3)10/(PbTiO3)10 superlat- 
tice was done using discrete grids of (200Ax) x (200Ay) x (250Az) with 
Ax= Ay=Az=0.4nm, where Ax, Ay, and Az are in real space. The thickness 
of the substrate, film, and air are (30Az), (198Az), and (22Az), respectively. In the 
film, alternating n grids of PbTiO; layers and n grids of SrTiO; layers are incorpo- 
rated to simulate the periodically layered superlattice, where n is the superlattice 
periodicity (for example, n= 10 for the 10 x 10 superlattice). Periodic boundary 
conditions are assumed in both the x and y directions, and a superposition method 
is used along the z direction*”. Random noise is used as the initial setup to simulate 
the thermal fluctuation during the annealing process. 

Using this approach, the 3D phase-field simulation was carried out for a 
(SrTiO3) 1o/(PbTiO3);o superlattice, and the results are shown in Fig. 4. The vortex— 
antivortex pairs extending along DyScO; [010], are highlighted in Fig. 4 by red 
and blue colours having opposite vorticity. The long-range ordering of these vortex 
pairs along DyScO; [100], in a 10 x 10 superlattice can be clearly seen in Extended 
Data Fig. 5c; a subsection shows a vortex—antivortex pair in Extended Data Fig. 5d, 
which is also shown in Fig. 2d. 

A comparison of phase-field models for n= 10 and n=50 PbTiO; thickness 
highlights the difference between the vortex and flux-closure domain structure 
(Extended Data Fig. 7). In a true vortex state, polarization is continuously changing 
about a vortex core giving rise to an inhomogeneous polarization distribution. 
Such a polarization state is demonstrated using both HR-STEM displacement 
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vector mapping (Extended Data Fig. 7a) and phase-field simulations (Extended 
Data Fig. 7b) for a 10 x 10 superlattice. In a flux-closure domain (Extended Data 
Fig. 7d, e), on the other hand, regions of homogeneous polarization (with a net 
polarization) can be identified, which are separated by domain walls, giving rise 
to a polarization distribution similar to those previously observed in bulk materi- 
als'®. In ref. 18, free-standing single crystal lamellae of BaTiO; were probed using 
piezoresponse force microscopy, and the results show formation of flux-closure 
domains, at the length scale of micrometres, which are very similar to the domain 
pattern reported in ref. 19. 

Phase-field calculations were further employed to complete a phase diagram 
of the evolution of vortex states in these superlattices (that is, total energy density 
as a function of superlattice periodicity; Extended Data Fig. 8). At lower periodic- 
ities (<10 x 10), calculations predict the existence of ferroelastic domains, shown 
as a;/ap by solid squares. a/a-type domain structure have domains which are 
polarized along the in-plane directions (the directions along which mechanical 
constraints are applied, in this case along [100],. and [010],.), and are separated 
by 90° domain walls, making them ferroelastic in nature’. After reaching a critical 
periodicity (~10 x 10), total energy calculations show that the vortex structure 
becomes more favourable as compared to the a;/a2. Upon transitioning to larger 
superlattice periods (>27 x 27), the polarization state is stabilized by incorporating 
flux-closure-type domains, where regions of homogeneous polarization exist, and 
are separated by domain walls. 

Sample size. No statistical methods were used to predetermine sample size. 
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Extended Data Figure 1 | RHEED-controlled growth of superlattices. 
intensity variation for growth of PbTiO; from a stoichiometric PbTiO; or 


RHEED oscillations for five periods of a (SrTiO3)¢/(PbTiO3)¢ superlattice 
grown using a 20% excess Pb target are shown. The oscillations were 0% Pb excess target. Inset, RHEED diffraction pattern obtained at the end 


present throughout the growth of the 100-nm-thick superlattice. of growth, highlighting the rapid transition to 3D growth mode when a 
a, RHEED oscillations for six unit cells of PbTiO3, followed by oscillations stoichiometric target is used (Methods). 
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Extended Data Figure 2 | Rutherford backscattering spectrometry 
(RBS) characterization. An RBS spectrum of a 200-nm PbTiO; film 
grown on a single-crystalline GdScO3(001)p, substrate is shown, along 
with the simulated spectrum (indicated by ‘Best fit’ in the figure) for the 
same structure, revealing an effective Pb:Ti ratio of 1.02:1, which is very 
close to ideal stoichiometry. 
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a HRSTEM HAADF b Example Subregion 


c Gaussian Fit d Subregion 
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Extended Data Figure 3 | Fitting atomic positions in the HR-STEM b, An example subregion from an HR-STEM image (unrotated). c, The 
images. a, An HR-STEM HAADF image of a (SrTiO3)j0/(PbTiO3) 10 corresponding Gaussian fit of b. d, An example of the B-site-centred five- 
superlattice. Polar displacement vectors (Ppp) are calculated by first atom clusters which are fitted simultaneously. 


finding atom positions by fits of 2D Gaussians to the image intensity. 
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Extended Data Figure 4 | Extracting polar displacement vectors (Ppp) four nearest cation neighbours (MNP, mean neighbour position) is used 
from HR-STEM data. a, The Gaussian fit of a region of an HAADF STEM to determine a relative sublattice offset. d, A polar displacement vector 
image. b, A magnified view of the boxed subregion in a, showing two (yellow arrow for A-site and red arrow for B-site), taken as the difference 


overlapping unit cells, namely, the A-site-centred perovskite unit cell, and between each atom (filled circle) and the MNP (‘x’), is given opposite sign 
the B-site-centred perovskite unit cell. c, The mean position of each atom’s _ for A- and B-sites to maintain a consistent direction. 
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units for polar displacement are in pm). c, A cross-section of the phase- 
field simulation for the (SrTiO3)1o/(PbTiO3) 10 superlattice showing the 
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Extended Data Figure 5 | Polar displacement vector map and phase- 


field simulation of a (SrTiO3)10/(PbTiO3) 19 superlattice. a 


The polar 
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with interlayer alignment. This sample 
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exhibits preference for alignment of matched rotations 


polarization vectors (red arrows 


dio! 


displacement vector map (black arrows) corresponding to the (SrTiO; 
(PbTiO3) 10 superlattice in Fig. 2a is shown overlaid on the colourized 


indicated by 
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alignment is also found (indicated 
in the figure). Such defects are found 


but a phase shift of anti- 
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rotation boundary 


blue arrows) 


(V x Ppp) (o10). b, A magnified view of the highlighted polar vortex shows 
the smooth rotation increasing towards a vortex core. The extremes of the 
colour scale of the curl vector are —16.75 pmnm~! and 16.75pmnm_ |, 
with blue and red colours indicating negative and positive values of curl 
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occasionally. d, A subregion of the simulation, highlighting a vortex- 


antivortex pair with the curl of the polarization overlaid. 
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Extended Data Figure 6 | Vortex-antivortex arrays along both pseudo- DF-TEM of the same (SrTiO3) ;9/(PbTiO3) 19 superlattice sample taken 
cubic in-plane directions. a, Cross-sectional DF-TEM of a (SrTiO3)10/ along the orthogonal [100], zone axis (inset as a). In both cases, clear 
(PbTiO3) jo superlattice taken along the [010], zone axis (inset shows the vortex—antivortex structures are observed. 


SAED pattern and the g-vector for image formation). b, Cross-sectional 
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Extended Data Figure 7 | Differentiation of vortex structure from a 
flux-closure domain structure using phase-field simulations and 
HR-STEM polar displacement mapping. a, HR-STEM polar 
displacement vector map of a single vortex in a (SrTiO3) 19/(PbTiO3) 10 
superlattice. b, Phase-field-calculated polarization vector map of a single 
vortex in a (SrTiO3)10/(PbTiO3) 10 superlattice. c, Total energy density 
variation within a single vortex in a (SrTiO3) j9/(PbTiO3) 19 superlattice 
calculated from phase-field simulations. d, HR-STEM polar displacement 
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vector map of a 20-nm PbTiO; layer in a heterostructure having the 
same layer stacking as in ref. 19. e, Phase-field-calculated polarization 
vector map of a flux-closure domain in a 20-nm PbTiO; heterostructure 
(red, blue, green and pink arrows in d and e indicate the direction of net 
polarization within their respective domains). f, Total energy density 
variation within a flux-closure domain calculated using phase-field 
simulations. 
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Extended Data Figure 8 | Phase evolution of vortex structure. The 
phase-field-calculated total energy for various phases competing with a 
vortex phase is depicted. See Methods for details. 
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Extended Data Figure 9 | HR-STEM displacement vector map of a positions. b, A magnified image of a single vortex. c, Polar displacement 
(SrTiO3)10/(PbTiO3) jy superlattice. a, Polar displacement vector map of vector map of the same region as b, but plotting only Pb (A-site) centred 
a (SrTiO3) 19/(PbTiO3) 19 superlattice obtained from the same HR-STEM displacements (like Fig. 2a). d, Polar displacement vector map of a single 
data set as Fig. 2a, showing the polar displacement vectors at all atom vortex showing the raw displacement vectors. 
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Macromolecular diffractive imaging using 


imperfect crystals 


Kartik Ayyer!, Oleksandr M. Yefanov!, Dominik Oberthtir?, Shatabdi Roy-Chowdhury*, Lorenzo Galli’, Valerio Mariani!, 
Shibom Basu**, Jesse Coe**, Chelsie E. Conrad**, Raimund Fromme”, Alexander Schaffer?*, Katerina Dérner!?, 

Daniel James**, Christopher Kupitz*°, Markus Metz’, Garrett Nelson*», Paulraj Lourdu Xavier), Kenneth R. Beyerlein!, 
Marius Schmidt®, losifina Sarrou’, John C. H. Spence*”°, Uwe Weierstall*°, Thomas A. White!, Jay-How Yang**, Yun Zhao*®, 
Mengning Liang®, Andrew Aquila®, Mark S. Hunter®, Joseph S. Robinson’, Jason E. Koglin®, Sébastien Boutet®, Petra Fromme*, 


Anton Barty! & Henry N. Chapman!” 


The three-dimensional structures of macromolecules and their 
complexes are mainly elucidated by X-ray protein crystallography. 
A major limitation of this method is access to high-quality 
crystals, which is necessary to ensure X-ray diffraction extends to 
sufficiently large scattering angles and hence yields information 
of sufficiently high resolution with which to solve the crystal 
structure. The observation that crystals with reduced unit-cell 
volumes and tighter macromolecular packing often produce 
higher-resolution Bragg peaks’ suggests that crystallographic 
resolution for some macromolecules may be limited not by their 
heterogeneity, but by a deviation of strict positional ordering of the 
crystalline lattice. Such displacements of molecules from the ideal 
lattice give rise to a continuous diffraction pattern that is equal to 
the incoherent sum of diffraction from rigid individual molecular 
complexes aligned along several discrete crystallographic 
orientations and that, consequently, contains more information 
than Bragg peaks alone’. Although such continuous diffraction 
patterns have long been observed—and are of interest as a source of 
information about the dynamics of proteins*—they have not been 
used for structure determination. Here we show for crystals of the 
integral membrane protein complex photosystem II that lattice 
disorder increases the information content and the resolution 
of the diffraction pattern well beyond the 4.5-angstrém limit of 
measurable Bragg peaks, which allows us to phase’ the pattern 
directly. Using the molecular envelope conventionally determined 
at 4.5 angstr6éms as a constraint, we obtain a static image of the 
photosystem II dimer at a resolution of 3.5 angstréms. This 
result shows that continuous diffraction can be used to overcome 
what have long been supposed to be the resolution limits of 
macromolecular crystallography, using a method that exploits 
commonly encountered imperfect crystals and enables model- 
free phasing’. 

High-resolution Bragg diffraction from a crystal requires a high 
degree of regularity, which is often not obtained in practice. If one 
of the structural units that make up a crystal (such as a molecule) is 
displaced from the ideal lattice by an amount s, then the phase of the 
diffracted wave from this unit is changed by 21s/d at a scattering angle 
20 corresponding to the resolution d, leading to destructive interfer- 
ence at that resolution when s = d/2. If all units are displaced ran- 
domly with a root-mean-square (r.m.s.) displacement o along a given 
coordinate, then Bragg intensities will accordingly diminish at a res- 
olution of d=270 according to the well known Debye-Waller factor 
exp(—417°q’o"), where q = 1/d=2sin(6/A) for an X-ray wavelength \ 


(ref. 8). The disorder length a is easily estimated from a so-called 
Wilson plot? of the diffraction data. Less widely appreciated is the fact 
that although the Bragg peaks diminish exponentially with the square 
of scattering angle, an incoherent sum of the diffraction of individ- 
ual structural units arises contrariwise®!°!! (see Fig. 1 and Methods 
section “The effect of rigid body disorder on Bragg and continuous 
diffraction’) to conserve the energy lost from the Bragg peaks. This 
incoherent sum is equal to the Fraunhofer diffraction pattern of the 
structural unit multiplied by the number of units, without any mod- 
ulation by Bragg peaks. 

Continuously modulated diffraction intensities have frequently 
been observed in diffraction patterns of protein crystals!°-°, and 
have even been ascribed to the disorder in the arrangement of rigid 
structural units as large as single macromolecules’”"'*. Although 
the continuous diffraction was seen to follow the point-group sym- 
metry of the crystal'*, previous studies did not consider the use of 
these patterns for structure determination. Protein crystals usually 
contain several molecules or molecular assemblies per unit cell as 
well as a solvent. The packing of molecules is mediated by non- 
covalent contacts between them (H bonds as well as electrostatic 
and hydrophobic interactions). It is therefore reasonable to propose 
a model in which the macromolecule is the rigid structural unit. In 
this case, with uncorrelated molecular displacements, the continuous 
diffraction pattern will consist of the incoherent sum of the contin- 
uous diffraction patterns of the macromolecule aligned in each of 
its various orientations within the crystal (given by equation (5) in 
Methods). 

Accessing the ‘single molecule’ continuous diffraction overcomes 
the phase problem of crystallography, by increasing the number of 
observable coefficients to exceed the number required to describe 
the molecule, independent of resolution!®!” (see Methods section 
‘Constraint ratio’). Bragg intensities can be equivalently represented 
in real space, by Fourier transformation, as the three-dimensional 
(3D) pair-correlation function (or Patterson function’) of the crystal. 
This correlation function cannot always distinguish between object 
pairs existing only in one unit cell or straddling neighbours, but such 
an ambiguity is removed if the object is non-repeating. The highly 
sampled single-molecule diffraction data of an imperfect crystal (aver- 
aged over the orientational symmetries of the point group) could be 
used to obtain a real-space image of the macromolecule at a resolu- 
tion beyond the resolution of the Bragg peaks, using iterative phasing 
algorithms'®”* of coherent diffractive imaging and aligned molecule 
diffraction’®. 
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Figure 1 | Lattice disorder reveals the continuous molecular transform. 
a, b, Diffraction from an arrangement of objects in a perfect lattice (a) 
results in regularly spaced Bragg peaks (b) as a result of constructive 
interference. c, d, Translational disorder of the objects at a length scale 

a (c) disrupts Bragg interference beyond a reciprocal-lattice resolution 


Measurements from crystals of the large integral membrane pro- 
tein complex photosystem II (PSII) support the notion that the inher- 
ent resolution of the single asymmetric unit in the crystal can be 
better than that of the entire crystal. Recent X-ray free-electron laser 
(XFEL) experiments yielded Bragg diffraction from crystals with 
volumes 1 jum? to a resolution of 5 A (ref. 23). The protein-cofactor 
complex consists of a dimer of approximately 700 kDa. There are 
only four crystal contacts per dimer, and 64% solvent by volume 
in the crystal. If ascribed to disorder of the lattice alone, the Bragg 
limit of 5 A could be caused by a r.m.s. displacement of molecules by 
0.8 A in each dimension (1.4-A r.m.s. 3D displacement; see Methods 
section ‘The effect of rigid body disorder on Bragg and continu- 
ous diffraction’). Bragg diffraction from large PSII crystals (1 mm? 
in volume) extending to 1.9 A was recently reported’, achieved by 
dehydrating the crystal in the presence of small amphiphiles, leading 
to an increase in crystal contacts and a tighter crystal packing. We 
set out to determine whether the limited crystallographic resolution 
of 5 A is caused by random displacements of rigid-body structural 
units. 

Still snapshot diffraction patterns, such as those shown in Fig. 2a, 
were recorded from room-temperature microcrystals of PSII in ran- 
dom orientations, as per the method of serial femtosecond crystallog- 
raphy”??? (see Methods section ‘Crystallography’). The orientation of 
each individual crystal was determined from its diffraction pattern by 
indexing the Bragg peaks”, allowing it to be inserted into a 3D array to 
build up the full intensity distribution”’ in the crystal coordinate frame 
(Fig. 2c, d). The accumulation of continuously modulated features, or 
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length of d= 1/q=2n0 (d). The loss of correlation gives rise to the 
incoherent sum of the Fraunhofer diffraction patterns of individual 
objects, which increases with q in balance with the diminishing Bragg 
intensities. Here, o was chosen to be 4% of the lattice spacing, leading to 
reduction of Bragg peaks at the fourth order. 


‘speckles; in the diffraction volume implies that they are associated 
with the crystal rather than other scattering sources. Further, con- 
sistent with the hypothesis of rigid-body random displacements, the 
features are absent at the lowest resolutions and become dominant 
at scattering angles beyond the highest-angle Bragg peaks. The size 
of the structural unit can be estimated in an unbiased fashion from 
the autocorrelation of the diffracting object (the 3D pair-distribution 
function), which is obtained by a Fourier transform of the continuous 
diffraction intensities””. We found it to be in agreement with the size 
of a PSII dimer, as shown in Fig. 3. 

One can consider several ways to obtain a high-resolution structure 
from the data shown in Fig. 2. We found that a particularly robust 
method was to treat the Bragg and continuous diffraction as two dis- 
tinct sources of data of the same structure, the first arising from a 
coherent interference of molecules in the unit cell, and the second due 
to the incoherent addition of aligned single-molecule diffraction (see 
Methods section “Two data sources from one experiment’). Starting 
with a known search model for PSH, we first produced an electron 
density map of the PSII dimer by molecular replacement phasing to a 
resolution of 4.5 A, limited by the angular extent of the Bragg peaks. 
This map was then used to generate a low-resolution binary mask of 
the smoothed molecular envelope of a single PSII dimer, shown in 
Fig. 3d. This mask forms the real-space constraint’ used to generate a 
3D image of the electron density of the PSII dimer by iterative phasing 
of the continuous diffraction. 

The iterative phasing of the continuous diffraction, covering a res- 
olution range of 4.5-3.3 A, was carried out using the difference-map 
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Figure 2 | Molecular coherent diffraction. a, An XFEL snapshot ‘still’ 
diffraction pattern of a PSII microcrystal shows a weak speckle structure 
beyond the extent of Bragg peaks, which is enhanced in this figure 

by limiting the displayed pixel values. b, Structure factors obtained 

from Bragg-peak counts from 25,585 still patterns, displayed as a 
precession-style pattern of the [001] zone axis. c, A rendering of the 


algorithm'®’° (Methods section ‘Iterative phasing’). As with other 


iterative phasing algorithms, an initial guess of the electron density 
obtained from random phases is constrained on each iteration to be 
consistent with the measured diffraction and to fit within a certain 
finite (but not necessarily precisely known) real-space extent called the 
‘support’—the smoothed envelope generated from the 4.5-A resolution 
electron density map in this case. 

After convergence of the iterate, the Fourier amplitudes and phases 
over the entire resolution range were combined to synthesize a 3.3-A 
resolution structure. Averaging solutions obtained over multiple ran- 
dom starts produced a self-consistent electron density to a resolution 
of 3.5A, as determined by the Fourier shell correlation and the phase 
retrieval transfer function’® (Fig. 3e). Finally, pseudo-crystallographic 
refinement”? of this electron density was carried out, following a sim- 
ilar procedure as for single-particle cryo-electron microscopy data 
(Methods section “Model refinement’). Regions of the final electron 
density map are noticeably superior to the same regions generated 
using the Bragg peaks alone (Fig. 4). In particular, helices show bet- 
ter definition in side-chains, and the model better follows the elec- 
tron density. The benefit of including the continuous diffraction 
data is quantified in the improvement of the R¢ce factor at low res- 
olution (Extended Data Fig. 1). Although the assembled continuous 
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entire 3D diffraction volume assembled from the 2,848 strongest 
patterns. d, A central section of the diffraction volume in c normal to the 
[100] axis. Speckles are clearly observed beyond the 4.5-A extent of Bragg 
diffraction (indicated by the white circles in b and d) to the edge of the 
detector. 


diffraction data are quite noisy, owing to the limited number (2,848) 
of diffraction patterns, the structure clearly benefits from their inclu- 
sion. Control experiments verified these benefits (see Methods section 
‘Control analyses and comparisons’). Further improvements could be 
expected with more measurements. 

Our approach, demonstrated with serial diffraction data collected 
using an XFEL, may also be applicable to larger crystals measured with 
conventional X-ray sources. The advantages of data collection with an 
XFEL include a higher tolerable dose and the capacity to measure data 
from hundreds of thousands of crystals, from which the most suitable 
can be selected solely on the basis of their diffraction patterns. The 
improved structural refinement at a resolution beyond that of meas- 
urable Bragg peaks establishes the origin of the continuous diffraction 
and demonstrates its utility in obtaining substantial increases in res- 
olution without having to improve crystal lattice order. The crystal is 
simply a means to obtain an ensemble of oriented macromolecules to 
carry out aligned-molecule diffractive imaging*’. We anticipate that 
the approach will be applicable to a range of macromolecular systems, 
and may allow ab initio phasing of crystals. After many decades of 
being discarded as of little use, all-too-common ‘poorly diffracting’ 
protein crystals might now be exploited for high-resolution structure 
determination. 
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Figure 3 | An unbiased size estimate of the rigid structural unit. 

This estimate is obtained by a Fourier transform of the continuous 
diffraction intensities, yielding the autocorrelation function (3D pair- 
distribution function). a, Projection of the experimentally determined 3D 
autocorrelation along the crystal c axis. b, c, The equivalent projections of 
the autocorrelation functions calculated from the 4.5-A model of the PSII 
dimer (b) and the monomer (c) after applying the point-group symmetries 
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FSC, PRTF 


of the crystal. The extent of the rigid structural unit matches the size and 
shape of the PSII dimer. d, A loose support generated by thresholding and 
dilating the 4.5-A-resolution structure was used as the support constraint 
for iterative phasing. e, Plots of the Fourier shell correlation (FSC; blue) 
and phase retrieval transfer function (PRTF; red). A value of PRTF=0.5 
is reached at a resolution of 3.5 A. The dashed line corresponds to the 
boundary between the phased Bragg and the continuous diffraction. 


Figure 4 | Improvement in resolution and quality of electron density. 
This improvement is obtained by directly phasing the continuous 
transform using the method of coherent diffractive imaging. a, Electron 
density map of the PSII dimer after refinement using structure factors 
obtained only from Bragg peaks at 4.5-A resolution, contoured at one 
standard deviation of the densities (1c). b, Electron density map obtained 
by iterative phase retrieval on the continuous diffraction data using the 


support constraint (molecular envelope) of Fig. 3d at 3.5-A resolution, 
contoured at lo. c, d, Electron density maps of two regions of the PSII 
dimer, using only the Bragg diffraction (models shaded green), and 

the Bragg and continuous diffraction (models shaded blue). c, Two 
antenna chlorophylls in the antenna protein PsbC with their His residues, 
contoured at 1.50. d, The haem group of PsbE/F, contoured at 1.250. 
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Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Crystallography. Microcrystals were grown by a free interface diffusion centrif- 
ugation technique, as described previously”*. The size of the crystals ranged from 
0.5,4m to 544m in width, which was determined by dynamic light scattering. The 
space group of the crystals is P2)2)2) (ref. 23). 

The experiments were carried out at the Coherent X-ray Imaging (CXI) end 
station*! of the Linac Coherent Light Source (LCLS)*”. X-ray pulses with dura- 
tions of 40 fs and photon energy of 9.48 keV were focused using X-ray optics in 
a Kirkpatrick-Baez geometry to a beam size of about 1 x 11m”. A suspension 
of PSII crystals in their crystallization buffer’* was pre-filtered using 20-j1m 
stainless-steel filters and delivered at room temperature to the X-ray interaction 
region as a 3-4-\.m-diameter jet at a flow rate of 10-15 jlmin“', using a gas 
dynamic virtual nozzle (GDVN)**™ as previously described”*. The average X-ray 
pulse energy at the sample was about 1.0 mJ, assuming a beamline transmission of 
60%. The dose to each crystal was at most 275 MGy, and diffraction patterns were 
acquired on the basis of ‘diffraction before destruction®>°. Diffraction frames were 
collected on a Cornell-SLAC Pixel Array Detector (CSPAD) at a rate of one frame 
per X-ray pulse, at 120 Hz (ref. 37). 1.24 million frames were collected, during an 
integrated collection time of 2.9h. 

The diffraction frames were initially processed following a standard data evalua- 
tion pipeline established for serial femtosecond crystallography. Frames containing 
crystal diffraction were found using Cheetah** on the basis of the identification of 
Bragg peaks. These ‘hits, which accounted for about 5.0% of the frames, were trans- 
ferred to CrystFEL (version 0.6.0)”**° along with the peak positions for automated 
indexing and to obtain a merged set of integrated Bragg intensities. Indexing the 
lattice determines the crystal orientation and crystal lattice parameters for each 
diffraction pattern, information that was later used to assemble the full 3D pattern 
shown in Fig. 2c. The relative coordinates of all pixels in the segmented CSPAD 
detector were determined by a procedure using a strongly diffracting standard 
(lysozyme)*°. Using these calibrated pixel positions, a total of 25,585 patterns out 
of 61,946 were successfully indexed to create a final list of Bragg reflections and 
their integrated intensities. The resultant data set of Bragg intensities extended to 
a resolution of 4.5 A, as indicated by the modified Pearson correlation coefficient 
CC* (ref. 41) (Extended Data Fig. 2). 

The crystal orientations obtained by indexing the lattice were subsequently used 
to assemble the individual diffraction-pattern snapshots into a 3D reciprocal-space 
grid representing the full-pattern intensity distribution of the average disordered 
crystal. The volume consisted of 501 x 501 x 501 cubic voxels, each with a width 
of 1.364 x 10-7 A~! (where q=1/d). Thus, the magnitude of the wave-vector 
transfer q at the face centre of the voxel array was 0.341 A~!, corresponding to a 
resolution of 2.93 A. Most of the individual snapshot patterns did not show notice- 
able continuous diffraction above a substantial featureless background caused by 
scattering from the water. The strength of the background depends linearly on 
the thickness of the liquid jet, and was removed by subtracting the average com- 
puted by summing counts along circles of constant q after correcting for the linear 
polarization of the X-ray beam. This radial average excluded very high and very 
low values to avoid the influence of Bragg peaks. The strength of the remaining 
crystal-dependent continuous diffraction signal for each frame was estimated by 
first replacing Bragg peak values with a local fit of the slowly varying continuous 
signal. This was achieved by using a thresholded median filter: pixel values that 
differed from a 5 x 5-pixel median-filtered copy of the pattern by more than one 
standard deviation above the mean were selected as bright pixels. The values of 
these, and pixels within a 3-pixel distance of them, were replaced with the values 
from the 5 x 5-pixel median-filtered copy. The strength of the continuous diffrac- 
tion signal for each frame was best estimated from pixels in the range of 0.1-0.2 A! 
after excluding the Bragg signal, and is plotted as a function of the total integrated 
Bragg-peak signal in Extended Data Fig. 3 for all of the 25,585 indexed patterns. 
The plot shows a strong linear correlation between these two signals. The incident 
intensity for each snapshot diffraction pattern fluctuates, owing to the random 
targeting of crystals to the X-ray focus profile and their variation in size. The 
trend shows that increased intensity or crystal size leads to an increase in both the 
Bragg and continuous diffraction. A single linear trend indicates little variation 
in the degree of lattice disorder of the crystals. We found that selecting the 2,848 
patterns with the strongest continuous diffraction to create the full 3D pattern gave 
higher signal-to-noise and continuous-scattering contrast at high resolution than 
including all 25,585 patterns. 

The effect of rigid-body disorder on Bragg and continuous diffraction. A devi- 
ation from strict translational correlation in a crystal gives rise to a component of 
the diffraction pattern that is continuous, which is usually studied in the context of 
protein dynamics*!"™, The time-averaged diffraction of a moving object can be 
considered as the incoherent sum of static snapshots over the time of the exposure, 
and, for uncorrelated motions of structural units in a crystal, the time-averaged 
diffraction is equal to any particular snapshot in the limit of a large number of unit 
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cells. Thus, time-integrated diffraction measurements cannot distinguish static 
disorder from dynamic. In the case of XFEL exposures, the positions of structural 
units such as entire macromolecules are certainly static. Following the approach 
of ref. 42, we first derive the diffraction of a translationally disordered crystal con- 
taining only one rigid object per unit cell (that is, P1 symmetry). The density of this 
crystal in terms of the real-space coordinate x, p,(x), can be described by 
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p,(x) ® o> 6(x — ayn — An) (1) 
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p(x) = 


where p,(x) is the density of the asymmetric unit, repeated over N positions in the 
crystal given by random displacement vectors A, from ideal lattice positions a,. 
In equation (1), ® represents a convolution and 6(x) is the Dirac delta function. 
Without loss of generality, the mean displacement (A,,) =0, and (A?) = 0. 
The 3D diffraction intensity distribution is the squared modulus of the Fourier 
transform of the crystal electron density p.(x). This can also be obtained by calcu- 
lating the Fourier transform of the electron-density autocorrelation function (the 
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The double sum in equation (2) can be understood for large crystals in the follow- 
ing manner. A constant lattice difference vector a,,— a; is obtained for many pairs 
of n and k, as illustrated in Extended Data Fig. 4. For this difference, the sum 
samples all possible displacement differences, A,— Aj, except when n=k. Thus, 
the delta function at that lattice difference vector is broadened by the probability 
distribution of the displacement difference. If all unit cells are displaced inde- 
pendently, this probability distribution of the difference vectors will be the auto- 
correlation of the probability distribution of a single displacement. For a Gaussian 
with a standard deviation of , this is just another Gaussian with a standard devi- 
ation of 2c. Thus 
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Keeping in mind that the Fourier transform (F) maps convolutions to multi- 
plications, the intensity can be calculated from equations (2) and (3) as 
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where F,(q) is the (complex-valued) Fourier transform of p,(x) at a resolution of 
|q| =1/d. Equation (4) shows that the diffraction pattern of a disordered crystal 
with a single asymmetric unit consists of the squared modulus of the molecular 
transform F,(q), modulated by two terms. The second term is the reciprocal lat- 
tice, which gives the Bragg peaks, further modulated by the Debye-Waller factor, 
exp(—41707q’). The first term has no lattice associated with it, but is a monoton- 
ically increasing function with a value of 0 at q=0 that asymptotes to a constant 


© 2016 Macmillan Publishers Limited. All rights reserved 


LETTER 


for q>1/(210). Thus, at high q values, the diffraction is proportional to the con- 
tinuous diffraction pattern of the rigid unit, |F,(q)|’. 

Generalizing further to the case of M asymmetric units in a unit cell, one can 
write the unit cell electron density as 


M 
>» px(Rin F tm) 
m=1 
where R,,, and f,, are the rotation and translation operators for the mth asymmetric 
unit. As in equation (1), the electron density for a disordered crystal is 
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where A,,,, is the random displacement of the mth asymmetric unit in the nth 
unit cell. 

The Patterson function of p.(x) can be split into two terms, the first a sum of 
like terms connecting corresponding asymmetric units in different unit cells and 
the second consisting of the cross terms. Thus 
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For the first term in this equation, the convolution of the two delta function sums 
can be evaluated in a similar way to that of the case of the single asymmetric unit 
in equation (3). This yields a term Né(x) for n=k and the blurred delta functions 
for all other cases. However, for the cross term, because different asymmetric units 
are assumed to move independently, this convolution does not have the Né(x) term. 
Taking the inverse Fourier transform of P(x) and simplifying yields 
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On gathering the Bragg terms and factorising, the intensity distribution 
simplifies to 
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This equation has a similar form to equation (4), in that there are two terms, 
representing the continuous and Bragg diffraction. The difference is that each 
term is modulated by different combinations of the asymmetric unit transform. 
The continuous diffraction is the sum of the intensities of each asymmetric unit 
transform, whereas the Bragg peaks depend on the coherent sum of the asymmetric 
unit transforms (the unit-cell transform). This means that only the Bragg peaks 
are affected by the systematic absences of the space group, whereas the continuous 
diffraction depends only on the point group of the crystal. 

This derivation started with the assumption of normally distributed displace- 
ments of rigid-body asymmetric units. The conclusions hold for other displace- 
ment distributions as well, whereby the term exp(—41707q7) is replaced by the 
Fourier transform of the autocorrelation of the probability distribution. For a 
smooth (band-limited) distribution, the Fourier transform will reduce to zero 
at high q, and the continuous diffraction term will dominate. Furthermore, the 


derivation can be generalized to a number of non-identical rigid structural units 
in the unit cell. 

Two data sources from one experiment. Structure determination could be con- 
sidered in terms of a whole-pattern analysis, possibly based on equation (5). An 
alternative used here was to treat the Bragg diffraction and continuous diffraction 
as two separate data sources that both encode structural information of the macro- 
molecular system. The observable Bragg diffraction covers the range 0.03-0.22 A~? 
(4.5-A resolution). The continuous diffraction overlaps this range, but extends 
to a resolution of about 2.2 A (seen in the corners of Fig. 2d). However, to avoid 
the influence of the large pixel values of strong Bragg peaks when separating the 
continuous diffraction, we consider just the range 0.22-0.35 A”! for this data set. 
Our strategy for using both forms of diffraction data was to use the continuous 
diffraction to extend the resolution of a structure determined from the Bragg data 
alone and to add strong phasing constraints by directly accessing the molecular 
transform of the rigid structural unit. 

Given equation (5), which predicts that the continuous diffraction is propor- 
tional to the incoherent sum of transforms of single rigid units aligned in their 
various crystallographic orientations, we expect the statistics of the continuous 
diffraction intensities to follow that of molecular diffraction, which should be 
exponentially distributed*’. This can be seen in Extended Data Fig. 5, which shows 
a linear trend in the logarithmic histogram above a background level of around 
one count per voxel. 

The size of the rigid structural unit that gives rise to the observed continuous 
diffraction pattern was estimated from the extent of the autocorrelation function, 
obtained by a Fourier transform of the continuous diffraction intensities. From 
equation (5), this yields the incoherent sum of autocorrelation functions of the 
structural unit in its various crystallographic orientations. To avoid the influence 
of the Bragg peaks on this calculation, this symmetrized autocorrelation function 
was generated from windowed regions of the full 3D pattern above a resolution of 
1/q=4.5A, following a procedure previously described”. Using a Gaussian win- 
dow function with a 15-pixel radius, 30 autocorrelation functions were generated 
in different locations in an orthogonal central section of the full 3D pattern. The 
functions produced from each subregion of reciprocal space were then averaged to 
obtain a single estimate of the projected symmetrized autocorrelation in the cor- 
responding orthogonal direction (Fig. 3a). Although the windowing in reciprocal 
space reduces the spatial resolution of the determined autocorrelation map, it is 
still sufficient to identify the structural unit as an object of similar size to the PSII 
dimer. The assumption of the PSII dimer as the rigid structural unit was ultimately 
validated by the final electron density map obtained by phasing the continuous 
diffraction, and by improvements obtained in structural refinement (Extended 
Data Fig. 1), discussed below (another choice was also tested as described below). 
This result is not surprising because the dimer of PSII is the asymmetric unit 
of the crystals and is also the native biological oligomeric form, present in the 
photosynthetic membrane, maintained during the isolation procedure and in the 
process of crystallization. 

The low-resolution structure from the Bragg data was obtained by molecular 
replacement phasing. The program F2MTZ was used to obtain an MTZ-file from 
the integrated intensities produced by CrystFEL. Structure-factor amplitudes were 
generated using TRUNCATE™. Molecular replacement phasing was performed 
using PHASER* in PHENIX** with the starting model being the PSII monomer 
from a published 1.9-A-resolution structure? (PDB, 3WU2). All ligands and co- 
factors except lipids, detergents, water and cryo-protectant molecules were kept 
in the input model. A composition containing the input model and its repeat was 
searched for, and PHASER found a single solution. Three rounds of simple refine- 
ment were performed using phenix.refine (with the program options rigid body, 
XYZ coordinates and occupancies). In the refinement, the 3WU2 structure was used 
as a reference model to restrain relationships between atoms to avoid overfitting. 
No further model building was carried out after this step. This process generated 
Rand Riree values of 24.8% and 27.2%, respectively. A rendering of representative 
structural elements of PSII in the electron density map of the 4.5-A-resolution 
structure derived from Bragg peaks only is shown in Fig. 4 (the model rendered 
in green). 

The 4.5-A-resolution electron density map was used to generate a binary map 
of the support of the PSII dimer (Fig. 3d) to be used for iterative phasing of the 
continuous diffraction. The inverse Fourier transform of the phased Bragg struc- 
ture factors yields the unit-cell electron density map. To obtain the electron den- 
sity of a single PSII dimer from the unit cell, phenix.cut_out_density was used to 
select only the density that was at most 2 A from any atom in the refined struc- 
tural model of the PSII dimer. This density was Fourier transformed to obtain the 
squared modulus of the molecular transform up to 1/q=4.5 A, which was then 
symmetrized by the point-group operations consisting of 180° rotations about 
each orthogonal axis and inversion through the origin (Extended Data Fig. 6). The 
similarity between this low-resolution computed continuous diffraction pattern 
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and the measurements gives further confidence in the choice of the PSII dimer 
as the rigid structural unit. To avoid over-constraining the phasing of the con- 
tinuous diffraction data (described below), a dilated binary support mask was 
generated from the PSII dimer electron density by convolving it with a Gaussian 
of width 4.4 A (three voxels) and thresholding the blurred density. Orthogonal 
slices through the support can be compared with the PSII dimer electron density 
image in Extended Data Fig. 7. 

Iterative phasing. The 3D continuous diffraction was phased using the 
difference-map algorithm!’. This algorithm iteratively refines the electron 
density image using an update rule that is expressed in terms of so-called projection 
operators operating in a finite-dimensional vector space. This N-dimensional space 
spans all possible complex-valued electron density images in a 3D array of 
N voxels in total. A vector x in this space represents a particular electron density 
image. A projection of x onto a constraint set C, Pc(x), is the nearest point to x that 
satisfies the constraint C. The difference map update rule is 


x—>x+Pp[2Ps(x) — x] — Ps(x) 


where P(x) is the support projection operator and Pp(x) the data projection 
operator. 

The support projection operator enforces the electron density to be zero outside 
the support volume (or low-resolution molecular envelope) of the PSII dimer 
described above. The data (or Fourier) projection operator enforces the electron 
density to be consistent with the measured Fourier magnitudes. In iterative phase 
retrieval of continuous coherent diffraction patterns, this projection operation is 
usually carried out by Fourier transforming the current iterate x and setting the 
magnitude to the measured magnitude while keeping the phases from the iterate. 
Here, this projection was modified in two ways. First, the phases at low-resolution 
are provided by molecular replacement phasing, so, for these voxels in reciprocal 
space, the projection operator was set to simply replace the complex Fourier ampli- 
tude by these amplitudes and phases. Second, the point-group symmetries were 
applied to the squared Fourier magnitudes. This point-group symmetrization has 
been discussed in ref. 16. In the absence of symmetry, the multiplicative factor 
applied to the Fourier amplitudes is \/Tops / |F;, where F, is the Fourier transform 
of the iterate and [,,; is the observed intensity. In the case of a symmetrized inten- 
sity, a similar factor is used, except that |F,| is replaced by the point-group sym- 
metrized version of itself. 

Before applying the phasing algorithm, the molecular transform calculated from 
the phased Bragg intensities was scaled in magnitude and interpolated to match the 
merged continuous diffraction data. First, the low-resolution electron density grid 
was padded with zeros by an amount to approximately match the reciprocal-space 
voxel size to that of the continuous data. The zero-padded volume was then Fourier 
transformed and stretched using linear interpolation to make the voxel sizes match 
exactly. The relative scaling of the magnitude of continuous diffraction to Bragg 
diffraction depends on the lattice disorder length and detector pixel size. This scal- 
ing was determined empirically at an intermediate q range (corresponding to the 
resolution range 5.23—4.58 A) in which both data types were non-zero and could 
be compared. The scale factor was chosen to minimize the sum of the squares of 
the intensity differences of the two data sets over the voxels in the resolution range. 

Starting from a random white-noise guess for the voxel values of the electron 
density, the difference-map algorithm converged within 100 iterations, retrieving 
over 30 million phases. The convergence was achieved in far fewer iterations 
than is usual in 3D coherent diffractive imaging problems of similar size’, prob- 
ably owing to the initial well-estimated support and foreknowledge of the low- 
resolution phases. The final 3D electron density image of the PSII dimer was 
calculated by averaging the complex amplitudes over another 100 iterations. Any 
variations in retrieved phases at a particular voxel will reduce the sum in that 
voxel as compared with the sum of the moduli. The ratio of these sums, known 
as the phase retrieval transfer function”’, thus gives a measure of the consistency 
of the retrieved phases, and the complex sum naturally truncates the data where 
the phases are not known. In addition, this process was repeated with multiple 
random initial guesses. The solutions obtained were compared using a Fourier 
shell correlation (FSC) metric to confirm that there was no bias due to the choice of 
initial guess. Resolution-dependent plots of these metrics are shown in Fig. 3e, and 
the correlation between the continuous diffraction pattern and the symmetrized 
Fourier transform of the averaged iterate is shown in Extended Data Fig. 8a. 
Model refinement. To interpret the 3.5-A-resolution electron density volume 
image, and to compare with the lower-resolution refinement of the Bragg data, we 
refined a structural model to best match the diffractive image. Although it would 
make sense to refine a molecular model to the electron density map in real space, 
refinement was done in reciprocal space following a protocol recently established 
for interpreting cryo-electron microscopy maps using conventional crystallo- 
graphic software”. The 3D electron density image (and hence the complex-valued 
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Fourier amplitudes) was taken to be a crystal of P1 symmetry consisting of one 
PSII dimer in a lattice much larger than that of the real crystal. The phase proba- 
bilities, written in the form of Hendrickson—Lattman (HL) coefficients, were used 
for reciprocal-space refinement, which was performed against an MLHL target 
(maximum likelihood with experimental phase probability distribution) using 
PHENIX. Refinement statistics are shown in Extended Data Table 1. 

Control analyses and comparisons. Two control analyses were performed to test 
whether the phasing of the continuous diffraction was an artefact of the input 
model. In the first test, the continuous diffraction was replaced by its orientational 
average, which is constant for a given q magnitude. In this case, any improvement 
in the quality of the reconstruction would be purely an artefact of the support con- 
straint. We found that different random starts led to uncorrelated reconstructions, 
as shown by the FSC plot in Extended Data Fig. 8a. In the second test, the rigid 
structural unit was assumed to be the PSII monomer instead of the PSII dimer. 
Not only is this physically unexpected, owing to the stronger binding that forms 
the PSII dimer as compared with crystal contacts, but the symmetrized molecular 
transform of a monomer exhibits a larger speckle size and smaller extent of the 
autocorrelation than those observed (data not shown). 

A comparison of the electron density maps of the molecular structure are given 

in Fig. 4 and Extended Data Fig. 9, in which improvements can be seen by includ- 
ing the continuous diffraction. In Extended Data Fig. 9, as well as showing the 
electron density obtained by iterative phasing of the continuous diffraction data, 
we show the calculated model density obtained by pseudo-crystal refinement. The 
correlation of the electron density maps“ is given in Extended Data Fig. 9g, includ- 
ing a comparison with a previous high-resolution structure of ref. 2 truncated to 
a resolution of 3.5 A. 
Constraint ratio. The possibility to phase the diffraction pattern ab initio, such as 
by iterative methods, requires that the information content in the measurement 
exceeds that required to synthesize the electron density image. Such is not the case 
when measuring at Bragg peaks alone. The feasibility of phase retrieval in terms 
of a quantity (2 which represents the ratio of independent measurements to the 
number of unknown coefficients required to describe the electron density of an 
object at a given resolution, is discussed in ref. 16. At a resolution of q= 1/d, the 
object can be represented in real space by samples spaced by distances of d/2. The 
number of unknown coefficients to describe the object is equal to the volume that 
the object occupies divided by the voxel volume d?/8. The 3D region occupied by 
the object is known as the support of the object, S, with a volume given by V(S). The 
number of independent Fourier-space measurements is likewise determined by 
the volume of the support of the autocorrelation function of the object, because the 
autocorrelation function is equivalent (through a Fourier transform) to the meas- 
ured diffraction intensities. The autocorrelation function is Hermitian, because 
it is the Fourier transform of the real-valued intensities, and thus the number of 
independent coefficients is equal to half the volume of the support of the autocor- 
relation function V(As) divided by the same voxel volume d°/8. Thus 


V(As) 
2v(S) 


QS 


In the absence of additional information, phase retrieval is feasible when (2 > 1. 
As an example, consider an object that is shaped as a cuboid. The autocorrelation 
function is twice the length in each dimension, giving V(As) =8V(S) and 2=4. 

The continuous diffraction produced by a disordered crystal is given by the 
incoherent sum of the transforms |F,(R,nq)|" for the orientations m, as described 
in equation (5). In this case, the number of independent measurements is equal to 
the volume of the union of the autocorrelation supports, V( Um, RmAs), divided 
by the number of symmetry operations of the crystal point group. In the case of 
PSII, there are four molecular orientations. The support of the PSII dimer S is 
shown in Fig. 3d and one slice through the union of the autocorrelation supports 
Um mAs is given in Fig. 3b. The volume of this symmetrized support is 16.6 times 
larger than V(S), yielding (2=2.1 when considering the four orientations and 
Hermitian symmetry. This value is lower than the case of only one orientation of 
the dimer, for which we find that the volume of the autocorrelation of the single 
support S of Fig. 3d is 10.9 times larger than V(S), and so (2=5.4. The effect of the 
symmetry is therefore not just a matter of counting symmetry operations, but 
depends on the shape of the autocorrelation function compared with the shape of 
the symmetrized autocorrelation. We see that even though the symmetry reduces 
the ratio 2, it is still greater than unity, indicating the feasibility of phase retrieval 
from the continuous diffraction data alone. 

Compared with the continuous diffraction pattern of a single object, Bragg data 
contains fewer independent measurements, reducing the ratio (2. In a crystal, one 
certainly knows that the object is contained in a volume equal to that of the unit 
cell, Vcey. However, the autocorrelation function (known in this case as the 
Patterson function) is periodic with a unit-cell volume that is equal to that of the 
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crystal, V.ey. Thus, the Hermitian Patterson function contains half the number of 
independent coefficients as does the crystal unit cell itself, yielding = 1/2. This 
is Sayre’s result’, and is eight times lower than for the single (non-crystalline) 
cuboid object mentioned above. Unlike the case of incoherent addition of the 
continuous diffraction of rigid units in the various crystallographic orientations, 
however, the constraint ratio for Bragg data does not depend on the space group. 
Although the unit-cell volume becomes larger to accommodate more molecules 
in a crystal of higher symmetry, this does not lead to a change in the information 
content even though it gives a higher density of Bragg peaks. For the space group 
P2212) of PSIL, there are four asymmetric units per unit cell. Therefore, the num- 
ber of unknowns is reduced by a factor of four. The number of independent meas- 
urements is reduced owing to the symmetry of the Patterson function. In this case, 
the symmetry of the Patterson function is Pmmm, which results in a factor of four 
reduction compared with P1, and leads once more to 2= 1/2. 
Non-crystallographic symmetry does help for Bragg data, as is well known for 
the method of molecular replacement, because it reduces the number of unknown 
coefficients needed to describe the object without necessarily reducing the number 
of independent measurements. A better knowledge of the support of the object also 
helps in this case, allowing the application of solvent flattening (equivalent to apply- 
ing the support constraint in iterative phasing). We can calculate the constraint ratio 
2 for the PSII Bragg data, given the support S of Fig. 3d. In this case, the number of 
independent measurements is proportional to the volume of the unit cell, divided 
by four (the number of identical copies) and divided by two. Compared with the 
number of unknown coefficients, which is proportional to V(S), we calculate 


= Veell 

8V(S) 
where the unit-cell volume is determined from the lattice parameters provided in 
Extended Data Table 1. This value suggests that the phase retrieval problem for 
this crystal is not solvable with only the Bragg data. 
Code availability. The code used to perform the merging of diffraction pat- 
terns into a 3D volume and the iterative phasing of the continuous diffraction is 
available on request. 
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Extended Data Figure 1 | Model refinement is improved at low 
resolution. A plot of the metric Rfree as a function of resolution shell q, 
showing a marked improvement of the model refined against the 3.5-A 
diffractive image. The blue curve shows R¢ee prior to the inclusion of the 
continuous diffraction and the red curve shows Rfree afterwards. Here Rfree 
is calculated using only Bragg intensities (which were excluded from the 
refinement) for a resolution below 4.5 A. 
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Extended Data Figure 2 | Data quality and resolution of Bragg 
diffraction. Plot of the reduced Pearson correlation coefficient, CC* 
(ref. 41), as an estimate of the consistency of the integrated Bragg 
intensities determined from 25,585 indexed patterns. A value of CC* =0.5 
is reached at q=0.23 A“, or a resolution of 4.3 A. 
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Extended Data Figure 3 | Strongest continuous diffraction occurs 
with strongest Bragg diffraction. a, Two-dimensional histogram of 
patterns, sorted by the integrated counts in the continuous component of 
the diffraction pattern (in a q range of 0.22-0.34 A~!) and the integrated 
signal in all detected Bragg peaks, for all 25,585 indexed patterns. We 
chose the 2,848 patterns with the strongest continuous diffraction signal 
above 17 X-ray counts (purple line) in the q region to generate the 3D 
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continuous pattern shown in Fig. 2c. The featureless background due to 
scattering from the solvent contributes 10 X-ray counts (blue line). b, The 
measurement from the liquid jet without crystals. c, d, Two representative 
patterns with speckle counts above the mean solvent background, but not 
above the threshold (c), and one of the 2,848 strongest patterns (d), as 
indicated by the positions of the yellow circles in a. 
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Extended Data Figure 4 | Patterson function of a disordered lattice. from their ideal positions; bottom right, the tails are lined up, resulting in 
A distorted lattice (left, black circles, with ideal positions in grey), with the distribution of head positions forming a broader Gaussian. In the limit 
vectors (red arrows) connecting all lattice points that have difference of a large crystal, the resultant distribution is the autocorrelation of the 
vectors a, —a;= (1, 1). On the right, the arrows from the left panel are displacement distribution. This process can be repeated for all difference 
translated in two ways: upper right, the heads and tails are both displaced vectors, leading to equation (3). 
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Extended Data Figure 5 | Continuous diffraction exhibits Wilson 
statistics. Histogram of merged continuous intensities in a q range of 
0.22-0.25 A~!. Above a background level of around one photon per pixel 
per pulse, the logarithm of the histogram follows a linear trend with 
negative slope, characteristic of the exponential distribution predicted by 
Wilson statistics”. 
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Extended Data Figure 6 | Central sections of the 3D full-pattern from the 4.5-A Bragg data, for the same set of orthogonal planes as in 
diffraction volume. a-c, Bragg intensities in planes normal to the three a-c, respectively. The intensities were calculated from the incoherent 
orthogonal reciprocal-space axes, qz, qy and q,, respectively, which were sum of the squared modulus of the 3D molecular transform of a single 
arbitrarily chosen to be parallel to the c*, b* and a* axes of the PSI crystal.  (uncrystallized) PSII dimer in each of the four orientations of the 
The 4.5-A extent of the Bragg peaks is indicated by the red circle. d-f, 222 point group. All panels are plotted on the same scale, with the 
Full-pattern diffraction intensities in central sections normal to the same experimental data (d-f) extending to q=0.33 A! at the centre edge. The 
three orthogonal axes as in a-c, respectively, obtained from that 2,848 agreement between each of d-f with the corresponding panel g-i is further 
strongest snapshot diffraction patterns. g-i, Continuous diffraction evidence that the rigid structural unit is the PSI dimer. 


intensities calculated for a single PSII dimer using the model refined 
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Extended Data Figure 7 | Real-space orthogonal slices. a—c, Slices of the used to for iterative phasing. The support was generated by blurring the 
observed structure factor (F,) electron density map, plotted as a grey scale, 4.5-A-resolution electron density map by 2.2 A and then thresholding 
of a single PSII dimer obtained from the 4.5-A Bragg intensities following to achieve a binary mask. g-i, Slices through the 3.5-A-resolution image 
model refinement of that data. Each slice is one pixel thick (1.5 A) and obtained by iterative phasing of the continuous diffraction, using the 

is normal to the z, y and x real-space axes, respectively (conjugate to support constraint illustrated in d-f and Fig. 3d. 

Jz dy and qx). d-f, Slices through the 3D real-space support constraint 
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Extended Data Figure 8 | The continuous pattern is consistent with 
diffraction from a rigid object in crystallographic orientations. 

a, When the continuous diffraction intensities are substituted for 
intensities averaged over all orientations (that is, constant on surfaces of 
constant q), iterative phasing using the support constraint of Fig. 3d fails, 
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as indicated by this plot of the FSC as a function of resolution for solutions 


(green) obtained from two independent phasing trials. The blue curve 

is the same FSC shown in Fig. 3e for comparison. The iterations never 
converged, so a phase retrieval transfer function for the control could not 
be generated. b, Plot of the cross correlation (CC) between the measured 
diffraction and that calculated from the determined electron density of a 
dimer, symmetrized by the four crystallographic orientations. 


© 2016 Macmillan Publishers Limited. All rights reserved 


1.50 


40 


LETTER 


f 

1.50 

g 3wu2 Bragg 
3WU2 1 0.731 
Bragg 1 


Continuous 


Extended Data Figure 9 | Electron density maps of regions of the PSII 
dimer. a-f, Electron density maps based on maximum likelihood structure 
factors (2mF,-DF-.) obtained using the Bragg diffraction (model shaded 
green, left), the Bragg and continuous diffraction (model shaded blue, 
middle) and computed from pseudo-crystal refinement (model shaded 
brown, right). The maps are rendered at various density levels relative 

to the standard deviation of the overall density. a, b, Non-haem iron 
coordinated by two His residues from D1 (chain A) and two from D2 
(chain D), contoured at 1.5a (a) and 40 (b). Neighbouring Tyr and Lys 
residues are displayed as well. c, Part of an «-helix (chain T), showing that 
the side chains (for example, Arg and Phe) fit better in the electron density 
when applying our new method (maps contoured at 1.5a). d, Helices from 


Continuous 


0.751 
0.963 


1 


chains Y and Z (maps contoured at 1.250). The density map shows more 
details and better agreement to the model when applying the analysis 
using the Bragg and continuous diffraction. e, Detailed view of a section 
of chain Z (maps contoured at 1.250). Using only the Bragg diffraction, 

no electron density is visible at this level around the side chains of Trp, 

Lys and Arg. Again, the model fits better into the map when using the 
continuous diffraction. f, Two chlorophylls and part of the transmembrane 
helix of chain C (maps contoured at 1.50). g, Matrix of Pearson correlation 
coefficients of electron density maps obtained from the model presented 
in ref. 2 (PDB, 3WU2), the model refined from Bragg diffraction and that 
obtained from the continuous diffraction. 
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Extended Data Table 1 | Data collection, phasing and refinement 


Bragg diffraction Continuous Pseudo-crystal 
diffraction refinement 
Point Group 222 222 1 
Space Group P2122, P1 
a, b, c (A) 133.25, 226.26, 307.09 735.3, 735.3, 735.3 250.8 250.8 250.8 
a, B, y(°) 90, 90, 90 90, 90, 90 90, 90, 90 

Resolution range (A) 30-4.5 (4.62—4.5) 30-3.20* 30-3.5 

Wilson B (A’) 191.6 

Indexed / Oriented patterns 25,585 2,848 

Completeness (%) 99.9 

V/sig(l) 5.01 (1.49) 

R-split (%) 11.48 (46.16) 

cc* 0.99 (0.94) 

Maximum dose (MGy) 275 

Refinement and phasing 

Method ML* Difference Map MLHLt 

No. reflections / phases 55,609 3.5 x 107 769,253 

Resolution range (A) 30.0—4.50 (4.58—4.50) 30.0-3.20* A: 30.0-3.50 (3.54-3.50) 
B: 30.0—4.50 (4.55 — 4.50) 

Rwork/Rfree (%) 24.8 / 27.2 (34.9 / 36.2) A: 31.7 / 32.4 (53.2 / 54.2) 
B: 21.7 / 23.0 (39.6 / 40.6) 

No. atoms 50,074 50,074 

No. voxels 1.26 x 10° 

Bond lengths (A) 0.005 0.012 

Bond angles (°) 1.400 1.817 


*ML, maximum likelihood. 


tMLHL, maximum likelihood combined with experimental phase probability distribution. 
+3.20-A resolution in the centre edge of the 3D voxel array. 
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Covariation of deep Southern Ocean oxygenation 
and atmospheric CO, through the last ice age 


Samuel L. Jaccard!?, Eric D. Galbraith?+°, Alfredo Martinez-Garcia®’” & Robert F. Anderson® 


No single mechanism can account for the full amplitude of past 
atmospheric carbon dioxide (CO) concentration variability over 
glacial-interglacial cycles'. A build-up of carbon in the deep 
ocean has been shown to have occurred during the Last Glacial 
Maximum”?. However, the mechanisms responsible for the release 
of the deeply sequestered carbon to the atmosphere at deglaciation, 
and the relative importance of deep ocean sequestration in 
regulating millennial-timescale variations in atmospheric CO, 
concentration before the Last Glacial Maximum, have remained 
unclear. Here we present sedimentary redox-sensitive trace- 
metal records from the Antarctic Zone of the Southern Ocean 
that provide a reconstruction of transient changes in deep ocean 
oxygenation and, by inference, respired carbon storage throughout 
the last glacial cycle. Our data suggest that respired carbon was 
removed from the abyssal Southern Ocean during the Northern 
Hemisphere cold phases of the deglaciation, when atmospheric 
CO, concentration increased rapidly, reflecting—at least in part— 
a combination of dwindling iron fertilization by dust and enhanced 
deep ocean ventilation. Furthermore, our records show that the 
observed covariation between atmospheric CO, concentration and 
abyssal Southern Ocean oxygenation was maintained throughout 
most of the past 80,000 years. This suggests that on millennial 
timescales deep ocean circulation and iron fertilization in the 
Southern Ocean played a consistent role in modifying atmospheric 
CO, concentration. 

Ice-core and surface-ocean proxy records provide a wealth of evi- 
dence supporting a central role of the Southern Ocean in modulating 
the air-sea partitioning of carbon during ice ages**. The Southern 
Ocean could exert a substantial control on the partial pressure of CO2 
(P¢o,)» Owing to its leverage on the efficiency of the global soft-tissue 
pump’ (STP) by which the photosynthetic production, sinking and 
remineralization of organic matter store dissolved inorganic carbon 
(DIC) in the ocean interior. At present, vertical exchange of water 
causes deeply sequestered CO, and nutrients to be brought rapidly to 
the Southern Ocean surface, while iron (Fe) limitation of phytoplank- 
ton prevents the exposed nutrients from being entirely fixed back to 
organic matter before the water sinks again, allowing a net release of 
CO, to the atmosphere. Either decreasing the rate of vertical exchange 
or enhancing export production would raise DIC concentrations in 
deep waters, lowering atmospheric CO . Higher DIC, in turn, enhances 
deep water corrosivity, dissolving carbonate (CaCO3) minerals and 
thereby increasing ocean alkalinity, which produces a further CO, 
drawdown’. 

Phytoplankton productivity in the Subantarctic Zone of the Southern 
Ocean surface was stimulated during glacial periods by an enhanced 
supply of Fe-bearing dust, which would have reduced the leakage of 
CO; to the atmosphere, strengthening the overall STP®’. Meanwhile, 
radiocarbon measurements on deep-sea corals and benthic foraminif- 
era have been interpreted as showing weakened exchange between the 


surface and the deep Southern Ocean during the Last Glacial Maximum 
(LGM)?*; if this is correct, such a weakened exchange would have com- 
plemented the effect of dust’. From the LGM to the Holocene, the com- 
bination of dwindling dust inputs and accelerated vertical exchange in 
the Southern Ocean would have released carbon from the deep sea, 
contributing to the rise of atmospheric CO. However, as yet, very little 
evidence has been reported for STP-driven CO; storage in the deep 
Southern Ocean that extends before the LGM. 

Here we report new geochemical observations of changes in the 
oxygenation of the deep Southern Ocean, chronicling STP carbon 
storage over the last 80,000 years (80 kyr). DIC storage by the STP, 
as defined here, is equal to zero in the ocean’s surface layer, even 
though surface waters can contain large amounts of residual carbon 
inherited from deep waters owing to the slow air-sea exchange of 
CO . This non-equilibrium component in surface waters is referred 
to as ‘disequilibrium DIC (ref. 10) and would be affected by sea ice, 
whereas STP DIC storage is independent of sea ice cover. Vertical 
stable carbon isotope (6°C) gradients in the Southern Ocean show 
large changes on glacial—interglacial and millennial (that is, 1-5 kyr) 
timescales'!. Although these gradients are influenced by storage of 
respiratory CO) in the deep ocean, they are also susceptible to large 
disequilibrium effects that may have decoupled 6'°C from DIC stor- 
age, since the air-sea equilibration of carbon isotopes is still an order 
of magnitude slower than for DIC itself!*. Because dissolved oxygen 
(O2) is consumed stoichiometrically during the respiration of sinking 
organic matter, while being replenished during air-sea exchange an 
order of magnitude faster than COs, it can provide a robust constraint 
on STP-induced carbon storage. 

New measurements were made on two sediment cores raised from 
2,800 m and 3,600 m depth at sites just to the north of the Weddell Sea 
(Methods, Fig. 1), where the bulk of Antarctic Bottom Water (AABW) 
forms through shelf processes and deep convection'*. Modern observa- 
tions show the presence of man-made chlorofluorocarbon-11 (CFC-11) 
near the core site, unambiguously identifying waters recently ventilated 
at the Southern Ocean surface (Fig. 1a). Idealized simulations with a 
fully coupled ocean—atmosphere-biogeochemistry model show that 
when deep convection in the Weddell Sea is weakened, waters at the 
core locations become depleted in oxygen, even in a simulation for 
which North Atlantic Deep Water is somewhat enhanced (Extended 
Data Fig. 1). At the same time, any change in export production and 
remineralization rates, including Fe fertilization by abundant glacial 
dust mobilized from South America during ice ages, would have altered 
the sinking flux of organic matter to the deep Southern Ocean, thereby 
changing the consumption rate of oxygen. Prior reconstructions have 
shown that glacial export fluxes were higher only in the Subantarctic 
Zone of the Southern Ocean", so that the Antarctic Zone sites explored 
here would have been affected by the southward mixing of oxygen- 
depleted waters from lower latitudes. Thus, the core sites lie at a critical 
nexus of the Southern Ocean, where the impact of both deep southern 
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Figure 1 | Modern oceanographic context. a, b, Present-day dissolved 
oxygen concentration” is shown on a meridional transect of the South 
Atlantic, averaged between 25° W and 10° E (a) and on a horizontal 
surface at a water depth of 2,800 m (b). Contours in a show the 
concentration of chlorofluorocarbon-11 (CFC-11), in picomoles per 
kilogram (ref. 27). Both core site locations are bathed by well oxygenated, 
recently ventilated subsurface water masses and observations suggest 
that the two sedimentary archives are sensitive recorders with which to 
monitor changes in deep ocean oxygenation. 


ventilation and Fe fertilization on the STP can be recorded by the 
sedimentary redox state. 

The fundamental control on sedimentary redox conditions is the bal- 
ance between the rate of supply of oxygen from overlying bottom waters 
and the rate of pore-water oxygen removal associated with respira- 
tion, which is regulated by the supply of organic matter. Comparing the 
sedimentary distribution of redox-sensitive trace metals with biogenic 
fluxes to the sea floor thus permits these processes to be disentangled, 
so that relative changes in oxygen concentrations at the water-sediment 
interface can be inferred. Manganese (Mn) enrichments are typically 
preserved in well oxygenated sediments, whereas dissolved uranium 
(U) typically diffuses across the water-sediment interface and subse- 
quently precipitates from pore waters under more reducing conditions 
(Methods). These mineral precipitates are referred to as authigenic 
phases. Monitoring the sedimentary distribution of these two metals 
thus provides sensitivity across the full range of oxygen concentrations 
typically encountered in the open ocean. 

At site TN057-13PC, authigenic U concentrations are highest during 
peak glacial conditions (LGM in Fig. 2c) and decrease rapidly there- 
after, coinciding with the increase in opal flux (Fig. 2e), a proxy for the 
rain of organic material to the seabed’* (Extended Data Fig. 2). Higher 
authigenic U during intervals of lower organic carbon supply to the 
sea floor can only reasonably be the result of decreased bottom-water 
oxygen concentrations!*!®, as also observed in prior low-resolution 
measurements (Extended Data Fig. 3 and Extended Data Table 1), 
indicating increased storage of remineralized carbon in the deep 
Southern Ocean during the LGM. Given that the solubility of oxygen 
in seawater is greater in cold waters, the observation that the deep 
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Figure 2 | Palaeoclimatic reconstructions covering the last glacial 
termination at site TN057-13PC compared with ice core records. 

a, b, Po, (a; ref. 28) and atmospheric 813C (b; ref. 4, error bars represent 
the standard deviation of replicate measurements) from Antarctic 

ice cores. c, Sedimentary authigenic U concentrations (reversed scale). 

d, Sedimentary Mn/AI. e, *°°Th-normalized biogenic opal flux”. Shaded 
intervals highlight cold periods in the Southern Hemisphere. YD, Younger 
Dryas; ACR, Antarctic Cold Reversal; HS1, Heinrich Stadial 1. The stippled 
line represents the bulk continental crust (BCC) Mn/Al background value 
(0.0167; ref. 30). The error (1¢) related to the determination of authigenic 
U is smaller than the symbol size. p.p.m., parts per million; p.p.m.v., parts 
per million by volume; VPDB, Vienna Pee-Dee belemnite standard. 


Atlantic was less well oxygenated during the LGM indicates that the 
solubility effect on oxygenation was overwhelmed by larger changes in 
the accumulated utilization of oxygen in the ocean interior’. 
Subsequently, during Heinrich Stadial 1 (HS1, 17.5-14.7kyr ago), 
authigenic U accumulation decreased (Fig. 2c) while Mn enrichments 
rose above the detrital background (Fig. 2d), indicating improved oxy- 
genation of bottom water. This is in striking contrast to the change 
observed in the deep North Atlantic, where oxygen concentrations fell 
to their lowest values during HS1, reflecting the reduced input of oxy- 
genated North Atlantic Deep Water!”. Increased oxygenation during 
HS1 must have included the effect of decreasing organic matter flux 
in the Subantarctic Zone, given the rapidly dwindling Fe supply to the 
surface ocean®, but the opposing behaviour with the North Atlantic is 
also consistent with a north-south ventilation seesaw'®. Importantly, 
a strong Mn enrichment recurred during the Younger Dryas (12.8- 
11.5kyr), when the changes in Fe supply were too small to have altered 
deep water oxygenation much’ (Extended Data Fig. 4). Thus, greater 
oxygenation of deep water at this location during the Younger Dryas 
event provides evidence for an increased proportion of oxygen-rich 
AABW relative to oxygen-depleted circumpolar deep water. A north- 
ward shift of the circumpolar fronts may have contributed to this 
increased oxygenation, but if spatial gradients of dissolved oxygen were 
similar to those that exist today (Fig. 1) then more rapid formation of 
AABW must have occurred as well, to induce the inferred increase in 
oxygen. Together, the records imply two phases of STP weakening in 
the Southern Ocean: the first (HS1) included combined effects of dust 
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Figure 3 | Palaeoclimatic reconstructions covering the interval MIS2 to 
MIS5a at site TN057-14PC and Ocean Drilling Program (ODP) site 
1090 (42.9° S, 8.9° E, 3,700 m) compared with ice core records. 

a, Relative abundance of diatom Eucampia antarctica (blue)—a sea surface 
temperature proxy—in TN057-14PC (ref. 29) compared to the Antarctic 
ice core local temperature reconstruction (light grey)*! on the new 
AICC2012 age scale**. b, Ice-core atmospheric Peo, Teconstructions 
(dark grey) and sedimentary authigenic U concentrations at site TN057- 
14PC (black, reversed scale). c, *°°Th-normalized biogenic opal flux at site 
TN057-14PC (ref. 29). d, °Th-normalized Fe (brown) and alkenone 
(green) fluxes at ODP site 1090 (ref. 6). Triangles indicate the tiepoints 
used to generate the new version of the TN057-14PC age scale. Shaded 
intervals highlight cold intervals in the Southern Hemisphere, whereas 
unshaded intervals correspond to North Atlantic cold periods (Antarctic 
Isotope Maximum (AIM) 1-4 in the Southern Hemisphere). The error 
(1) related to the determination of authigenic U is smaller 

than the symbol size. SMOW, Vienna mean ocean water standard. 
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supply and a north-south seesaw of ventilation, and probably reflected 
most of the net glacial—interglacial change’®, while the second (Younger 
Dryas) was dominated by a transient pulse of ventilation alone. These 
deglacial observations are consistent with the stable carbon isotopic 
composition of atmospheric CO, (5!Catm) (ref. 4) (Fig. 2b) in suggest- 
ing that carbon was removed from the deep Southern Ocean during 
two episodes of improved deep ocean oxygenation that occurred during 
HS1 and the Younger Dryas. 

Prior to the LGM, atmospheric CO, concentrations rose repeatedly 
by 10-20 parts per million during periods of intense cold conditions 
in the North Atlantic, while Antarctica warmed. These intervals cor- 
respond to perturbations of the Atlantic Meridional Overturning and 
advances of sea ice in the North Atlantic, similar to HS1 and the 
Younger Dryas!’, when deep North Atlantic O concentrations 
plummeted'”. Our new data show that each of these intervals was also 
marked by decreasing authigenic U concentrations (Fig. 3b), concur- 
rent with a greater rain of organic detritus to the sea floor (Fig. 3c). 
Thus, similar to the millennial-timescale changes of the last glacial 
termination, the oxygenation proxies are consistent with the release of 
deeply sequestered remineralized carbon to the atmosphere during 
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climate oscillations between 35 kyr and 80 kyr ago. Despite uncertain- 
ties in age models and the potential for nonlinear behaviour in authi- 
genic U accumulation, the Pearson's correlation coefficient 7 between 
the TN057-14PC authigenic U and the ice-core CO, is 0.63 (P=10~", 
n= 89) over the period 35-80 kyr ago. We note less consistency between 
25kyr and 35 kyr ago, when authigenic U was lower than expected from 
the Pco, level; we were unable to find any sedimentological artefact that 
might have introduced a bias over this interval (Extended Data Fig. 5), 
and therefore suggest that at this time, during which there was a major 
expansion of the Northern Hemisphere ice sheets”°, Pco, Was lowered 
by a mechanism unrelated to the Southern Ocean’s control on the STP, 
although this result should be further tested. 

Our results show that carbon storage in the deep Southern Ocean was 
generally amplified when Antarctica was cold, consistent with argu- 
ments based on the similarity of Antarctic temperature and atmos- 
pheric CO) over glacial cycles. However, the results also highlight a 
fundamental difference between the glacial-interglacial transition, 
during which the O, changes in the north and south deep Atlantic cor- 
relate, and the millennial-timescale changes, for which O, changes are 
antiphased between north and south. This difference can be explained 
by a contrast in the dynamics of ocean circulation, which were domi- 
nated by the bipolar ventilation seesaw on millennial timescales”! rather 
than the steady-state adjustment to different CO, levels, orbital forcing 
and ice sheet configuration that determined the glacial—interglacial 
change’®. Iron fertilization, caused by enhanced dust input during 
Antarctic cold phases®, would have contributed some degree of the 
observed Southern Ocean STP strengthening during both LGM and 
millennial-timescale changes. In this light, it may seem remarkable that 
changes in dust fluxes and deep ventilation were coordinated (Extended 
Data Table 2), reinforcing their synergistic impacts on the STP. 

We suggest that the coordination between ventilation and Fe sup- 
ply on millennial timescales occurred through a mechanistic link 
between the two. Model simulations have suggested that a weaken- 
ing of the Atlantic Meridional Overturning Circulation can lead to 
enhanced Southern Ocean ventilation via direct ocean circulation 
changes”! as well as by forcing a migration in the Southern westerly 
winds”. In addition, it has been shown that a southward shift in the 
precipitation-bearing Southern Westerlies could initiate the retreat of 
Patagonian glaciers*’. We suggest that, owing to this coupling of ocean 
and atmospheric circulation, millennial-timescale periods of enhanced 
deep Southern Ocean ventilation generally co-occurred with south- 
ward shifts of the Southern Westerlies, which simultaneously caused 
Patagonian glaciers to retreat, reducing both the production of fine- 
grained lithogenic material by glacial erosion” and the supply of dust 
by trapping glacial flour in proglacial lakes”*. Thus, shifts in atmos- 
pheric circulation, coupled with changes in the Atlantic Meridional 
Overturning Circulation, would have caused the intensity of Fe limita- 
tion and upwelling to co-vary on millennial timescales, while the inten- 
sity of deep convection was altered by both atmospheric and oceanic 
drivers. These coupled processes, coordinating the input flux of carbon 
to the deep sea as well as its release by deep convection and upwelling, 
can explain the remarkable power of the Southern Ocean in modifying 
the efficiency of carbon storage over millennial timescales, and thereby 
atmospheric CQ, levels. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

Material and sediment composition. TN057-13PC (53.2°S, 5.1°E, 2,848 m) and 
TN057-14PC (52.0°S, 4.5° E, 3,648 m) were retrieved from the Atlantic sector 
of the Southern Ocean, south of the present-day position of the Antarctic Polar 
Front (Fig. 1). The cores contain diatomaceous ooze sequences with discontinuous 
carbonate-bearing intervals and occasional ice-rafted debris layers. The deglacial 
sequence in core TN057-14PC is disturbed and thus was not further considered 
(Extended Data Fig. 6). 

Age model. The TN057-13PC age model is taken, unchanged, from ref. 29. The 
stratigraphy of sediment core TN057-14PC has been updated on the basis of 
graphical correlation between the relative abundance of diatom E. antarctica”’, a 
proxy for sea-surface temperature, and EDC deuterium (8D) (ref. 31) expressed 
on the new AICC2012 age model”, assuming an in-phase relationship (Fig. 3). 
Sedimentation rates are high, typically in the range <15-60cmkyr |, for glacial 
and peak interglacial intervals, respectively. 

Analytical methods. °°Th-normalized fluxes were evaluated as described in 
ref. 35. Concentrations of U and Th isotopes were determined by isotope dilution 
inductively coupled plasma mass spectrometry (ICP-MS)*°. The authigenic U 
content of each sample was evaluated using the measured *8U and ?”Th con- 
centrations using an activity ratio of 0.5 for the detritic endmember*”. Measured 
?30Th concentrations were corrected for the detrital contribution using *’Th and 
for ingrowth produced by authigenic U, and then decay-corrected to the time 
of deposition to derive *°°Th-normalized fluxes. Sedimentary aluminium (Al), 
manganese (Mn) and barium (Ba) concentrations were measured by inductively 
coupled plasma optical emission spectroscopy (ICP-OES; Varian Vista Pro) by ALS 
Minerals, Canada. Precision was better than 2% for replicate measurements. The 
biogenic fraction of barium—a useful proxy for integrated export production*— 
was determined using a detritic Ba/Al ratio of 0.0067 (ref. 30). The calculations 
of normative Ba/Al and Mn/Al—the excess Ba and Mn fractions relative to the 
detrital background, respectively—are based on the assumption that the com- 
position of the Al-bearing phases of the terrigenous material remained constant 
in space and time. CaCO3 was quantified by coulometric CO, determinations 
assuming no other carbonate-bearing phase was present. Precisions are +3% of 
the reported values. 

Redox-sensitive metals. The sedimentary distribution of redox-sensitive trace 
metals can be influenced by a number of factors, including (1) the rate of organic 
matter delivery to the sea floor, (2) changes in sediment accumulation rate, and 
(3) changes in bottom-water oxygenation. Comparing the abundance of multi- 
ple trace metals (in our case, Mn and U) in sediments with biogenic flux to the 
sea floor reconstructed using °Th normalization permits these processes to be 
distinguished. It follows that relative change of oxygen concentrations at the 
water-sediment interface can be inferred. 

The sedimentary geochemistry of Mn is dominated by the redox control of its 
speciation, with higher oxidation states (Mn(III) and Mn(IV)) occurring as rela- 
tively insoluble oxyhydroxides in well oxygenated environments and the lower oxi- 
dation state (Mn(II)) being much more soluble in oxygen-depleted settings”. Mn 
enrichments in the form of Mn-oxide coatings are observed in deep sea sediments 
where oxic conditions prevail to greater sediment depths as a result of low organic 
matter respiration rates and well ventilated bottom waters. In sediments that are 
deposited under reducing conditions, there is no accumulation of Mn oxides and 
the Mn concentrations of buried deposits are low and entirely controlled by their 
insoluble detritic fraction. The presence of Mn concentration in excess of what can 
be expected from the detritic fraction thus suggests that the host sediment must 
have accumulated under oxygen-replete conditions. 

Uranium behaves conservatively in oxygenated seawater and is present as U(VI). 
One of the primary removal mechanisms for U from seawater is via diffusion 
across the water-sediment interface of reducing sediments”. It appears that for 
sedimentary redox conditions near those for conversion of Fe(III) to Fe(II) (that 
is, suboxic conditions), soluble U(VI) is reduced to insoluble U(IV) (ref. 41). This 
lowers the pore-water U concentration, and dissolved U from the overlying bottom 
water can diffuse into the sediment along the declining concentration gradient, 
leading to an enrichment of authigenic (or excess) U in the sediment. 

The emplacement of both Mn and U authigenic mineral phases are uncorrelated 
to 3°Th-normalized opal fluxes (Figs 2 and 3, Extended Data Table 2). Asa result, 
we infer that the variability in organic carbon delivery to the sea floor—and its 
subsequent respiration below the sediment—water interface—was not the primary 
factor controlling the sedimentary redox conditions. 

The sedimentation rates inferred for both sediment cores are high enough to 
minimize filtering by bioturbation and to prevent post-depositional remobiliza- 
tion of U upon deepening of the sedimentary oxicline. Manganese, on the other 
hand, is frequently remobilized to the sedimentary pore fluids under reducing 
conditions. Dissolved Mn can thus migrate in the sedimentary column and 
(re)precipitate when oxic conditions are encountered®. As such, large Mn enrichments 
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primarily reflect a shift from oxygen-depleted to oxygen-replete conditions. 
The sediment accumulation rate increased across the last glacial termination, 
potentially affecting trace-metal concentrations. However, both authigenic U and 
Mn/Al downcore records show opposite behaviours across the deglaciation, sug- 
gesting that sedimentary dilution was not the primary controlling factor driving 
the trace-metal distribution (Extended Data Fig. 7). Furthermore, the comparison 
between sedimentation rates and authigenic U concentrations in core TN057- 
14PC (Extended Data Fig. 5) clearly shows that the downcore authigenic U pattern 
was not dominated by changes in sediment accumulation. We thus posit that the 
observed authigenic enrichments of both U and Mn were sensitive to changing 
oxygen levels at the sediment—water interface. 

Model simulations. The model simulations were carried out with the Geophysical 
Fluid Dynamics Laboratory (GFDL) coupled ocean-ice-atmosphere model 
CM2Mc, with the embedded biogeochemical model BLING”. Well equilibrated 
states of four simulations were used for the plots shown in Extended Data Fig. 1. 
Extended Data Fig. 1 shows the difference between two simulations with atmos- 
pheric CO, concentrations of 220 parts per million and LGM ice sheet topography, 
but different obliquity (axial tilt). Strong convection and AABW production occurs 
under low obliquity (22°), while moderate convection occurs under high obliquity 
(24.5°). All other boundary conditions and parameters are identical in the two 
model runs. The plots were made from 100-year averages over the last century of 
3,800-year simulations. 
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Extended Data Figure 1 | Idealized model experiments illustrating 
the impact of AABW production on dissolved oxygen relative to the 


core locations. Shaded contours show the difference in dissolved oxygen cores TN057-13PC and TN057-14PC. 
(AO;) averaged between 25° W and 10° E, for a coupled model simulation 


with strong Weddell convection compared to a simulation with moderate 
Weddell convection (Methods). Squares indicate the location of sediment 
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Extended Data Figure 2 | Biogenic particle flux reconstructed by “°Th CaCO; above glacial background values during HS1 and the YD is 


normalization for four independent proxies covering the last glacial consistent with enhanced ventilation of bottom waters during these 
termination at site TN057-13PC. a, °°Th-normalized total organic intervals. Enhanced ventilation of bottom waters would have lowered the 
carbon flux”. b, *°Th-normalized CaCO; flux. c, °°Th-normalized regenerated DIC concentration of the bottom water by releasing excess 
biogenic barium (bioBa) flux. d, °°Th-normalized biogenic opal flux”’. CO, to the atmosphere, raising the [CO37~] (ref. 44) and calcite saturation 
CaCO; and bioBa data are from this study. The accumulation of biogenic state of the bottom water? and thus reducing CaCO; dissolution. 
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Extended Data Figure 3 | Qualitative changes in oxygenation between the Holocene, respectively. White dots highlight cores where authigenic 
the LGM and the Holocene. Red/black dots indicate the location of U concentrations did not change much between these two intervals 
sedimentary records for which authigenic U concentrations/mass (see Extended Data Table 1 for details). Shadings show the modern bottom 


accumulation rates were higher/lower during the LGM when compared to _ water dissolved oxygen concentrations”®. 
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Extended Data Figure 4 | Palaeoclimatic reconstructions covering the last glacial termination at site TN057-13PC compared with ice core 


records. a, Atmospheric Poor (ref. 28). b, EPICA Dome C (EDC) dust flux”. c, Sedimentary authigenic U (aU) concentrations. d, Sedimentary Mn/Al. 
e, °°Th-normalized biogenic opal flux, f(opal)’. 
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Extended Data Figure 5 | Comparison of bulk sediment accumulation rates and authigenic U concentrations in sediment core TN057-14PC for the 
interval 20-80 kyr ago. a, Bulk sediment accumulation rates (SedRate). b, Authigenic U concentrations. 
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Extended Data Figure 6 | Comparison of the deglacial sequences at The grey shading highlights the disturbed portion of core TN057-14PC. 


sites TN57-14PC and TN057-13PC. a, 5'8Opachy (where ‘pachy’ refers to The black triangle highlights the presence of planktonic foraminifera 

the planktonic foraminifera Neogloboquadrina pachyderma) (ref. 46) and deposited during the Younger Dryas (that is, 12.18 kyr ago; ref. 46), which 
30Th-normalized biogenic opal flux”* in core TN057-14PC. b, §'’Opachy have been mixed down into late LGM sediments after the hiatus occurred. 
(ref. 47) and *?°Th-normalized biogenic opal flux”* in core TN057-13PC. 
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Extended Data Figure 7 | Cross-plot of Mn/Al and authigenic U across the last glacial termination in sediment core TN057-13PC. 
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Extended Data Table 1 | Available Southern Ocean low-resolution records permitting reconstruction of the authigenic U Holocene-LGM 
gradient compiled from the literature 


Sedimentary records showing higher aU concentration/accumulation during the LGM (18-22kyr) than the Holocene (0-10kyr) 


Core Latitude Longitude Depth (m) References 

VM27-196 -36.33 -7.57 1,106 Chase et al., 01 (ref. 48) 
RC12-267 -38.68 25.45 4,144 Anderson et al., 98 (ref. 49) 
TNO57-21 -41.13 7.82 4,981 Kumar et al., 95 (ref. 50) 
RC11-83 -41.60 9.72 4,718 Chase et al., 01 (ref. 48) 
VM34-157 -41.95 26.42 3,636 Anderson et al., 98 (ref. 49) 
VM22-109 -41.97 -0.25 733 Chase et al., 01 (ref. 48) 
ODP1090 -42.60 8.50 3,700 Martinez-Garcia et al., 09 (ref. 51) 
RC15-94 -42.90 -20.85 3,762 Kumar et al., 95 (ref. 50) 
VM22-108 -43.18 -3.25 4,171 Kumar et al., 95 (ref. 50) 
PS2082 -43.21 11.73 4,610 Frank et al., 00 (ref. 16) 
RC11-120 -43.52 79.87 3,193 Anderson et al., 98 (ref. 49) 
PS2498-1 -44.15 -14.49 3,783 Anderson et al., 14 (ref. 7) 
MD88-770 -46.00 96.50 3,290 Dezileau et al., 02 (ref. 52) 
MD88-769 -46.10 90.10 3,420 Rosenthal et al., 95 (ref. 53) 
RC15-93 -46.10 -13.21 2,714 Anderson et al., 98 (ref. 49) 
MD94-104 -46.50 88.10 3,460 Dezileau et al., O02 (ref. 52) 
PS1754 -46.77 7.61 2,471 Frank et al., 00 (ref. 16) 
RC13-254 -48.57 5.12 3,636 Kumar et al., 95 (ref. 50) 
PS1756 -48.73 6.71 3,787 Frank et al., 00 (ref. 16) 
MD88-772 -50.00 104.90 3,240 Bareille et al., 98 (ref. 54) 
RC11-78 -50.86 -9.86 3,115 Chase et al., 01 (ref. 48) 
MD80-304 -50.93 67.43 1,930 Rosenthal et al., 95 (ref. 53) 
RC13-271 -51.98 4.51 3,634 Kumar et al., 95 (ref. 50) 
PS1768 -52.60 4.48 3,270 Frank et al., 00 (ref. 16) 
MD88-773 -52.90 109.87 2,460 Bareille et al., 98 (ref. 54) 
RC11-77 -53.05 -16.45 4,098 Anderson et al., 98 (ref. 49) 
TNO57-13PC -53.20 5.10 2,848 this study 

MD84-552 -54.40 75.83 1,780 Dezileau et al., 02 (ref. 52) 
PS1772 -55.46 1.17 4,135 Frank et al., 00 (ref. 16) 
E11-2 -56.07 -115.09 3,111 Chase et al., 03 (ref. 55) 
E20-10 -60.22 -127.05 4,474 Chase et al., 03 (ref. 55) 


Sedimentary records showing lower aU concentration/accumulation during the LGM than the Holocene 


VM24-221 -32.03 -2.82 4,204 Chase et al., 01 (ref. 48) 
RC12-294 -37.26 -10.10 3,308 Chase et al., 01 (ref. 48) 
TNO57-20 -42.10 0.60 1,312 Chase et al., 01 (ref. 48) 
MD84-527 -43.83 51.33 3,269 Francois et al., 93 (ref. 56) 
RC13-259 -53.88 -4,93 2,677 Kumar et al., 95 (ref. 50) 
RC11-76 -54.38 -22.13 5,229 Anderson et al., 98 (ref. 49) 
E33-22 -54.93 -120.00 2,744 Chase et al., 03 (ref. 55) 
MD84-551 -55.10 73.30 2,230 Bareille et al., 98 (ref. 54) 
MD84-787 -56.40 145.30 3,020 Bareille et al., 98 (ref. 54) 
E27-23 -59.62 155.24 3,215 Anderson et al., 09 (ref. 29) 
KR8830 -61.00 93.20 4,300 Bareille et al., 98 (ref. 54) 
E17-9 -63.08 -135.00 4,849 Chase et al., 03 (ref. 55) 


Sedimentary records showing no significant difference in aU concentration/accumulation between the LGM and the Holocene 


VM24-240 -31.73 -28.20 4,327 Chase et al., 01 (ref. 48) 
VM24-229 -34.45 -10.60 4,202 Chase et al., 01 (ref. 48) 
VM27-197 -36.68 -6.27 4,089 Chase et al., 01 (ref. 48) 
RC13-243 -36.90 1.33 4,790 Chase et al., 01 (ref. 48) 
VM14-65 -41.00 8.00 4,825 Chase et al., 01 (ref. 48) 
MD94-102 -43.50 79.80 3,205 Bareille et al., 98 (ref. 54) 
NBP9802-6 -61.88 -169.98 3,245 Chase et al., 03 (ref. 55) 
NBP9802-5 -63.11 -169.74 2,940 Chase et al., 03 (ref. 55) 
MD88-791 -64.70 119.50 3,150 Bareille et al., 98 (ref. 54) 


Records from references 7, 16, 29 and 48-56. 
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Extended Data Table 2 | Statistical correlation between sedimentary proxies for the interval 20-82 kyr ago 


1090 f(Fe) | 1090 f(alk) | TNO57 aU__| TNO57 f(opal) | pCO2 
1090 f(Fe) 
1090 f(alk) 62; 0.0000 
TNOS7 aU | 52;0.0000 | .64;0.0000 
TNO57 f(opal) _| .24; 0.0000 18; 0.0000 | .13; 0.0000 
pCOz2 -71; 0.0000 .57; 0.0000 .58; 0.0000 | .32; 0.0000 


?30Th-normalized Fe and alkenone fluxes were measured at ODP site 1090 (ref. 6). Sedimentary U concentrations and 7°°Th-normalized opal fluxes? were measured at TNO57-14PC; Pca levels were 


measured on Antarctic ice??3*. Numbers in red represent the correlation coefficient, r2; the blue numbers indicate the P values, which are less than 10~2° in all cases. 
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Biomass resilience of Neotropical secondary forests 
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Land-use change occurs nowhere more rapidly than in the tropics, 
where the imbalance between deforestation and forest regrowth 
has large consequences for the global carbon cycle’. However, 
considerable uncertainty remains about the rate of biomass recovery 
in secondary forests, and how these rates are influenced by climate, 
landscape, and prior land use”*. Here we analyse aboveground 
biomass recovery during secondary succession in 45 forest sites 
and about 1,500 forest plots covering the major environmental 
gradients in the Neotropics. The studied secondary forests are 
highly productive and resilient. Aboveground biomass recovery 
after 20 years was on average 122 megagrams per hectare (Mgha~'), 
corresponding to a net carbon uptake of 3.05 Mg Cha 'yr7', 
11 times the uptake rate of old-growth forests. Aboveground 
biomass stocks took a median time of 66 years to recover to 90% of 
old-growth values. Aboveground biomass recovery after 20 years 
varied 11.3-fold (from 20 to 225 Mgha_') across sites, and this 
recovery increased with water availability (higher local rainfall and 
lower climatic water deficit). We present a biomass recovery map of 
Latin America, which illustrates geographical and climatic variation 


in carbon sequestration potential during forest regrowth. The map 
will support policies to minimize forest loss in areas where biomass 
resilience is naturally low (such as seasonally dry forest regions) 
and promote forest regeneration and restoration in humid tropical 
lowland areas with high biomass resilience. 

Over half of the world’s tropical forests are not old-growth, but 
naturally regenerating forests°. We focus here on secondary forests 
that regrow after nearly complete removal of forest cover for agri- 
cultural use*. Despite their crucial role in human-modified tropical 
landscapes, the rate at which these forests will recover, and the extent 
to which they can provide equivalent levels of ecosystem services 
as the forests they replaced, remains uncertain. Several studies sug- 
gest that, once a certain threshold of disturbance has been reached, 
tropical forests may collapse® or switch to an alternative stable state’. 
However, in general, secondary tropical forests have rapid rates of 
carbon sequestration, with potentially large consequences for the 
global carbon cycle’. The magnitude of the tropical carbon sink is 
relatively well known for old-growth forest®’, but current global 
estimates on carbon uptake and storage of secondary forests are rough 
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Figure 1 | Relationship between forest biomass and stand age using 
chronosequence studies in Neotropical secondary forest sites. 

a, AGB (N= 44); b, AGB recovery (N= 28). Each line represents a different 
chronosequence. The original plots on which the regression lines are based 
are indicated in grey (N= 1,364 for AGB, N=995 for AGB recovery). AGB 
recovery is defined as the AGB of the secondary forest plot compared with 
the median AGB of old-growth forest plots in the area, multiplied by 100. 
Significant relations (two-sided P < 0.05) are indicated by continuous lines; 
non-significant relationships (two-sided P > 0.05) are indicated by broken 
lines. Plots of 100 years old are also second-growth. See Extended Data Fig. 4 
for the same figure with plots colour-coded by forest type. 


or biased**? and we urgently need better-founded biomass recovery 
rates and resilience estimates. 

Secondary forests deliver a suite of ecosystem services that are 
closely linked to the biomass resilience of these forests’. We define 
biomass resilience as the ability to (1) recover quickly after distur- 
bance (that is, the absolute recovery) and (2) return to its original 
(pre-deforestation) state (that is, the relative recovery). Few studies 
have quantified forest recovery directly by monitoring secondary 
forest plots over time, and they typically cover only a few years (but 
see ref. 11). Chronosequence studies have evaluated different plots 
at different times after abandonment” and shown that recovery 
rates vary dramatically with climate’, soils'*, and management 
intensity'®. However, the meta analyses so far'*'°'8 did not con- 
sider Neotropical-wide environmental gradients and were not based 
on original data, making direct comparisons impossible because of 
variation in methodology and allometric biomass equations used. 
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Figure 2 | AGB after 20 years. a, In relation to annual rainfall; b, in 
relation to CWD for Neotropical forest sites. Lines indicate predicted AGB 
at 20 years based on a multiple regression including 1/rainfall, CWD, and 
rainfall seasonality (R? = 0.59). Other variables were kept constant at the 
mean across sites (two-sided P < 0.005 for 1/rainfall; P=0.03 for CWD). 
The third, less significant factor (rainfall seasonality) is shown in Extended 
Data Fig. 2. N= 43 sites (one site was excluded because no climatic data 
were available). 


Here we quantify rates of recovery of aboveground biomass (AGB) 
stocks in secondary forests, and assess to what extent recovery rates are 
driven by abiotic site conditions, landscape forest cover, and previous 
land use. We hypothesize that recovery rates increase with resource 
availability (high rainfall? and fertile soils) and decrease with degree 
of forest loss in the surrounding landscape matrix (implying lower 
seed availability), as well as with intensity of prior land use. We test 
these hypotheses using an unprecedented multi-site analysis includ- 
ing 45 sites, 1,468 plots, and >168,000 stems, spanning the major 
environmental and latitudinal gradients in the Neotropics, and we 
present a biomass recovery map to support policy decisions for the 
Neotropics. 

We compiled data from 45 chronosequence studies (Extended Data 
Table 1); we also included old-growth plots for 28 sites with no signs 
of recent human disturbance. In each plot all stems >5 cm diameter 
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Figure 3 | Potential biomass recovery map of Neotropical secondary 
forests. The total potential AGB accumulation over 20 years of lowland 
secondary forest growth was calculated on the basis of a regression 
equation relating AGB with annual rainfall (AGB = 135.17 — 103,950 x 
1/rainfall + 1.522 x rainfall seasonality + 0.1148 x CWD; see Methods). 
The colour indicates the amount of forest cover recovery (red is low 


were measured and identified, their biomass estimated using allometric 
equations, and summed across all trees to obtain live plot AGB. 

For each site, AGB was modelled as a function of time elapsed since 
abandonment: that is, since the main human activity (crop production, 
cattle ranching) ceased at the site. Biomass resilience was quantified in 
two ways. For absolute recovery rate, the site-specific models (N= 44; 
one outlier excluded) were used to predict AGB at 20 years, a represent- 
ative age for secondary forests in the Neotropics. For relative recovery, 
AGB was expressed as a percentage of the median AGB of old-growth 
forest plots in the area. AGB recovery was related to climatic water 
availability, soil cation exchange capacity (CEC), forest cover, and prior 
land use using multiple regression analysis. The best-supported equa- 
tion was used to produce a biomass recovery map of the Neotropics. 

Recovery of AGB showed a saturating relationship with stand 
age for the individual sites. After 20 years of secondary succession, 
AGB accumulation varied 11.3-fold (from 20 to 225 Mgha!, aver- 
age 121.8+7.5 s.e.m.) across sites (Fig. la). Biomass recovery after 
20 years, relative to old-growth forest AGB at the same site, varied from 
25 to 85% across sites (Extended Data Fig. 1). Across sites, AGB after 
20years increased with annual precipitation (P < 0.001), climatic water 
deficit (CWD; P< 0.001, a higher value here means a lower deficit), 
and rainfall seasonality (R* =0.59, P< 0.0001; Fig. 2, Extended Data 
Fig. 2a and Extended Data Table 2). There was considerable variation 
at a given level of water availability, which is also reflected in the root 
mean squared error of 31.7 Mgha~!. CEC, forest cover, and prior 
land use were not significant predictors of AGB accumulation across 
sites (Extended Data Fig. 2). For relative recovery, not only rainfall 
(P=0.040) but also CEC (P=0.027) had a significant positive effect 
on biomass recovery (Extended Data Table 2 and Fig. 1). The biomass 
recovery map (Fig. 3, see Extended Data Fig. 3 for an uncertainty map) 
shows high variation in potential AGB accumulation rates across the 
Neotropics, with low rates in dry forests in Mexico and high rates in 
humid forests in southern Mexico, Costa Rica, and large parts of the 
Amazon. 

We show that AGB of 20-year-old secondary forest varies more than 
11-fold across sites, with potentially large consequences for the carbon 
storage and mitigation potential of tropical forests (Supplementary 
Information $1). The carbon sequestration and mitigation poten- 
tial of secondary forests is immense. Average biomass after 20 years 


recovery; green is high recovery). The 44 study sites are indicated by 
circles (the symbol on the ocean belongs to an island) and the size of the 
symbols scales with the AGB attained after 20 years. The grey areas do 
not belong to the tropical forest biome. The map focuses on lowland 
tropical forest (altitude <1,000 m). See Extended Data Fig. 5 for a colour- 
blind-friendly map. 


corresponds to an annual net carbon uptake of 3.05 MgCha7!yr7', 
11 times the uptake rates of Amazonian old-growth forests in 2010 
(0.28 MgCha'yr7! (ref. 19)) and 2.3 times the uptake rates of selec- 
tively logged Amazonian forests (1.33 MgCha~'yr7! (ref. 20)) where 
reduced impact logging techniques have been applied. Clearly, anthro- 
pogenic disturbances open up the canopy, enhance light availability, 
and set the system back to an earlier successional stage, leading to lower 
standing biomass but faster growth rates of the remaining forest stand. 
Although second-growth forests have substantially lower carbon stocks 
and biodiversity than the old-growth forests they replace, their carbon 
sequestering potential is high. Therefore, we urge halting deforestation 
and advocate actions that promote natural regeneration in deforested 
areas. Recognition of the significant carbon mitigation potential and 
other important services and values delivered by second-growth 
forests”! can be an important motivating factor to reach national targets 
for forest restoration inspired by the Convention on Biological Diversity 
Aichi Targets, the Bonn Challenge, and the New York Declaration on 
Forests. 

Secondary forests differ dramatically in their biomass resilience, 
driven mainly by variation in water availability (Fig. 2, cf. ref. 13), 
although other factors play a role as well. Both higher rainfall and lower 
CWD increase plant water availability and extend the growing season, 
thereby increasing biomass growth of trees and forest stands”. Climate 
change scenarios predict less and/or more variable rainfall for several 
regions in the tropics”’, which may potentially hamper biomass recov- 
ery and forest resilience in these regions. 

Soil fertility (CEC) had a significant positive effect on relative bio- 
mass recovery, but no effect on absolute recovery. Perhaps the effect 
of CEC was weak because for many sites CEC was obtained from a 
global database rather than measured in situ, because large differences 
in macroclimate may overrule more subtle differences in soil fertility, 
or because forest productivity is often limited by phosphorus” or 
nitrogen”, which we did not measure. Forest cover in the surrounding 
matrix and prior land use did not affect biomass recovery (Extended 
Data Figs 1 and 2), perhaps because these were estimated. Forest recov- 
ery rate may slow down in isolated secondary forest patches because 
of reduced plant colonization and survival, harsher environmental 
conditions, and frequent disturbances. Biomass recovery did not differ 
between abandoned pastures and agricultural fields, possibly because 
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of high within-category variation in land-use intensity (such as use of 
fire and time under land use and frequency of cultivation cycles), which 
may strongly affect forest recovery'*!”. 

Dry and moist forests differ substantially in old-growth forest 
biomass, and relative recovery may therefore be a better indicator of 
forest resilience than absolute recovery rates. It takes a median time 
of 66 years to recover to 90% of old-growth values. However, several 
mid-successional sites of 40-100 years old attained higher biomass 
than old-growth forest (Fig. 1b), which has also been found by 
modelling studies”®. This is most probably because of a high abun- 
dance of old trees of long-lived pioneer species in mid-successional 
sites, which tend to be large and store large amounts of carbon?’ 
before they die. 

Several studies suggest that, once a certain threshold of disturbance 
is reached, tropical forests may collapse® or switch to an alternative 
stable state”. Our study shows that forests can be resilient, and that their 
biomass resilience strongly depends on water availability. By mapping 
potential for biomass recovery across the Neotropics, attention can be 
focused on particular areas that should be conserved or treated with 
extra caution because they are more difficult to restore (slow recovery). 
The recovery map can also be used to identify areas with high car- 
bon sequestration potential for United Nations collaborative initiative 
on Reducing Emissions from Deforestation and forest Degradation 
(REDD-+) programmes, or where (assisted) natural regeneration, or 
restoration and afforestation activities, may have the highest chance 
of success**”? (high recovery). Such a spatially explicit, resource- 
based approach paves the road towards a more sustainable design and 
management of human-modified tropical landscapes. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Site description. We compiled the largest data set of chronosequence studies in 
the lowland Neotropics so far, including 1,468 plots established in 45 forest sites 
(Extended Data Table 1), containing 168,210 tree stems. The sites were located in 
eight countries, covering the full latitudinal range in the tropics varying from 20°N 
in Mexico to 22°S in Brazil. We focused on lowland sites, generally below 1,000m 
altitude. Annual rainfall varied fivefold across sites (from 750 to 4,000 mm yr~), 
thus all major lowland forest types were included. Soil CEC varied 33-fold across 
sites (from 2 to 65cmol(+) kg”). Forest cover in the landscape ranged from 
31 to 100% across sites. 

Site conditions. We assessed whether biomass recovery rates were related to 
abiotic site conditions (rainfall, soil fertility), and land-use intensity (prior land 
use and percentage forest cover in the surrounding matrix). For each site, mean 
annual rainfall was obtained from the nearest climatological station. Biomass 
recovery should ultimately respond to plant water availability, which is a func- 
tion of annual rainfall and its seasonal distribution, topography, soil depth, 
and the water holding capacity of the soil. Rainfall seasonality (expressed as a 
coefficient of variation of rainfall) was obtained from WorldClim*’ and CWD 
(in millimetres per year) from http://chave.ups-tlse.fr/pantropical_allometry. 
htm. CWD is the amount of water lost during dry months (defined as months 
where evapotranspiration exceeds rainfall), and is calculated as the total rain- 
fall minus evapotranspiration during dry months. This number is by defini- 
tion negative, and sites with CWD of 0 are not seasonally water stressed. CEC 
(in centimoles of positive charge per kilogram of soil) was used as an indicator 
of soil fertility of the site. Ideally, local soil data should be used, but these were 
only available for 12 sites. For the other sites, CEC data were obtained from the 
Harmonized World Soil Database (HWSD)*". For those 12 sites, the local CEC 
and HWSD data are indeed strongly positively correlated (r= 0.84, P < 0.0006, 
n= 12), indicating that the HWSD ranks the sites well in terms of CEC. When 
local soil fertility data were available, the CEC of the old-growth forest was used, 
as this indicates potential site fertility, which is also included in the world soil 
database. The HWSD database did not contain data on soil nitrogen and phos- 
phorus. Phosphorus is thought to limit plant growth in highly weathered and 
leached tropical soils and correlates well with the productivity of old-growth 
Amazonian forests”4. Nitrogen may be more limiting during the first decades of 
secondary succession because a large part of vegetation nitrogen is volatilized 
when the slashed vegetation is burned”°, while soil preparation, crop cultivation, 
and nutrient leaching may further deplete soil nutrients. Indeed, early succes- 
sional forests have relatively many nitrogen-fixing trees. To describe the land- 
scape matrix, we estimated per site, for all plots combined, the average cover in 
the following five land use classes in a 1 km radius around the plots: mature forest, 
secondary forest, plantations/agroforestry systems, agriculture/pasture, other. 
This estimate is based on the long-term field experience of the authors at the site 
where they worked. We then calculated per site the average percentage total forest 
cover (of mature and secondary forest) in a 1 km radius around the plots. We 
used total forest cover because forests improve microclimatic conditions at the 
landscape level and act as important sources of seeds and biotic agents (mammal 
and bird dispersal agents, mycorrhizae) that may help to speed up the rate of sec- 
ondary succession. We classified the prior land use of each site as cattle ranching 
or shifting cultivation (or a combination of these two), which are the two major 
land-use types where forest regeneration occurs in the Neotropics. We acknowl- 
edge that different land use practices, such as fire frequency!® and length of 
cultivation’, could affect biomass recovery rates of abandoned fields, but it is very 
difficult to obtain that information in a consistent and quantitative way across 
all sites. 

Plots. Plot size varied from 0.01 to 1ha. Most plots were rectangular, with a 
length varying from 20 to 200 m and width ranging from 1 to 50m. The mean 
plot area across all ~1,468 plots was 947 m’, and the mean plot area across all 
45 sites was 1,273 m?. On average 33 (5-291) plots were established per chrono- 
sequence. Chronosequence sites varied in the range of stand ages across the 
included secondary forest plots (Extended Data Table 1). For 28 out of the 
45 sites, old-growth plots were included. Old-growth forests did not have signs 
of recent human disturbance, and were at least 100 years old. In most cases, the 
sampling design was the same as for secondary forest plots; however, for five 
of the sites, old-growth plots were slightly larger, for one site smaller. In each 
plot, all woody trees, shrubs, and palms >5 cm stem diameter at breast height 
(at 1.3m height) were measured for their diameter and identified to species, 
with the exception of three sites for which only data for trees >10cm diameter 
at breast height were available. 

Tree biomass. We first evaluated how estimates of live AGB varied on the basis 
of the application of three frequently used sets of allometric equations. We 
calculated live AGB for each tree using equations from refs 32-34 
(Supplementary Information $2). The equations in ref. 32 are based on stem 
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diameter only, and the equations in refs 33, 34 are based on a combination of 
stem diameter and wood density. Palms were treated in the same way as trees*, 
which may slightly overestimate palm biomass. Biomass was summed across 
all trees to obtain live plot AGB. The three different equations from refs 32-34 
resulted in small differences in absolute plot AGB estimates for dry forests 
(average AGB after 20 years ranged from 68 to 85 Mgha_' among the three 
equations), but yielded larger differences in AGB estimates for moist forest 
(precipitation 1,500-2,500 mm yr !, AGB after 20 years ranged from 142 to 
179 Mg ha~!; Extended Data Fig. 6). For 20-year-old forest, the earlier widely 
used equations***? predicted about 21-26% more biomass than the more 
recently developed equation (Extended Data Fig. 6). These differences become 
even greater with increasing forest age, with large consequences for assessing 
the magnitude of carbon uptake and storage in tropical forests (see, for exam- 
ple, ref. 19), as moist forests contribute most to the current forested area in 
the Neotropics. For further analyses we used the equation in ref. 34 because 
it provides an allometric equation based on the largest harvested tree data set 
analysed so far (and includes and extends the equation of ref. 33), including 
many secondary forest species for trees from wet and dry forests, for trees of 
most tropical continents. It is ecologically very relevant and methodologically 
elegant, as it uses a correction factor E. This bioclimatic stress variable takes 
temperature variability, precipitation variability, and drought intensity into 
account, and therefore continuous variation in site aridity. The equation in 
ref. 34 also takes variation in wood density into account, which is an important 
source of plot AGB variation across the Neotropics*®. For one site (Providencia 
Island) for which no E value was available, we estimated E from a regression 
equation relating E to annual rainfall. 

Data about species-specific wood density (in grams per cubic centimetre) 
came from the local sites, or from the Neotropical data of a global wood den- 
sity database*”** (http://datadryad.org/handle/10255/dryad.235), henceforth 
referred to as ‘Dryad. For all estimates of wood density, we used the data source 
(local or Dryad) that had the highest level of taxonomic resolution. In the case 
of the same level of taxonomic resolution, we used the local data source. If no 
wood density information was available at the species level, then genus- or 
family-level wood density values were used, as wood density is phylogeneti- 
cally strongly conserved". Across all sites except Surutu 1 and 2, Cayey, and 
Bolpebra, 94.0% of the trees of the secondary and old-growth plots had been 
identified to species (range 74.4-100%), 97.9% to genus (range 90.8-100%), 
and 98.9% to family (range 94.9-100%). For the other four sites, 36.0-55.6% 
of the trees had been identified to species, 61.6-86% to genus, and 61.6-86% 
to family. 

Plot structure. Biomass was summed across all trees in a plot to obtain live plot 
AGB and expressed on a per hectare basis (AGB, in megagrams per hectare). 
For one site (San Carlos) the original tree inventory data were not available 
anymore, so for this site we used the plot biomass that the author had estimated 
using locally derived allometric equations. For plots with a nested design, we 
calculated AGB per hectare by extrapolating the biomass of trees in smaller size 
classes that were measured in a part of the plot only to the total area measured, 
thus assuming that forest structure within the plot did not vary. For four sites, 
only trees >10cm diameter at breast height were inventoried, and for these sites a 
correction factor was used to obtain the estimated plot AGB when trees between 
5 and 10cm diameter at breast height would have been included. Plot AGB was 
therefore multiplied by 1.18 for dry forest and by 1.03 for moist forest (compare 
ref. 40). Some plots had extremely high AGB values (>500 Mgha'), because of 
the occurrence of a single large tree, in combination with a small plot size. These 
outliers (2.5% of the total number of plots) were excluded from the analysis. One 
site (Marques de Comillas) had extremely high average AGB (>100 Mgha!) 
2 years after land abandonment, because of a combination of many large rem- 
nant forest trees being left during land use, and a small plot size. This site was 
excluded from the analysis. Our analysis focused on AGB only, which should 
be a good indicator of carbon recovery over time, as below-ground biomass in 
roots scales tightly with AGB of trees*!, and soil organic matter (which includes 
necromass of dead fallen trees and roots) is rather constant with time after 
abandonment"®, 

Statistical analyses. No statistical methods were used to predetermine sample 
size. To evaluate how AGB (estimated with the equation from ref. 34) recovered 
over time, we modelled AGB in each site as a function of natural-logarithm- 
transformed age. We included an intercept, as in some cases there was AGB at 
time zero because remnant trees were left during forest clearance. Biomass resil- 
ience was quantified in two complementary ways: the absolute recovery rate and 
the relative recovery rate. For absolute recovery, the site-specific models (N= 44 
sites) were used to predict AGB at 10 and 20 years, as these ages were included in 
the age range of most sites, and are representative ages for secondary forests. To 
calculate annual carbon sequestration rates, we used the average AGB attained 
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after 20 years, multiplied it by 0.5 (which is the average carbon value in dry 
biomass), and divided it by 20. To evaluate how fast AGB of secondary for- 
ests recovers towards old-growth forest values, we calculated the relative AGB 
recovery as the AGB of the secondary forest plot compared with the median 
AGB of old-growth forest plots in the area (as a percentage). For each site, we 
estimated the recovery time as the time needed to recover to 90% of old-growth 
values. We used time to recover to 90%, rather than to recover 100% of old- 
growth values, as it indicates when the biomass is close to old-growth values. 
To attain the full 100% may take much more time, as we modelled biomass 
build-up to show an asymptotic relationship with time. This study aimed to 
assess the rates of recovery of tree biomass under conditions where regener- 
ation is occurring naturally. Sometimes, succession can be arrested owing to 
high-intensity land use or seed limitation. We did not target sites where suc- 
cession was arrested, so it is possible that our ‘average’ conditions could have 
overestimated rates of recovery when considering all potential areas. However, 
neither did we specifically select sites that were regenerating particularly fast. 
Given the high number of sites and plots across the Neotropics that cover the 
entire range of typical land-use types and intensities, we are confident that we 
provide realistic rates of biomass accumulation for dominant land-use types 
and intensities in the Neotropics. 

To evaluate how predicted absolute and relative recovery of AGB after 
20 years varied with abiotic site conditions and land-use intensity, we used mul- 
tiple regression analysis. We related AGB recovery to rainfall, rainfall seasonality, 
CWD, CEC, forest cover in the landscape matrix, and prior land use (pasture, 
mix of pasture and agriculture, agriculture) for 43 sites (for one island site no 
WorldClim data were available). AGB showed a saturating relationship with 
rainfall; therefore we used 1/rainfall as a predictor rather than absolute rainfall 
or the natural logarithm of rainfall, which improved model fits. We compared 
models of all possible combinations of predictors based on Akaike’s informa- 
tion criterion adjusted for small sample sizes (AIC.), and selected the best- 
supported model with the lowest AIC,. All analyses were performed in R 3.1.2 
(ref. 42). 

On the basis of our regression analysis, we constructed a potential bio- 
mass recovery map of the Neotropics at a resolution of 1 km?. First, a map of 
world ecoregions was obtained from the Nature Conservancy (http://maps. 
tnc.org/gis_data.html; last accessed on 26 January 2015). Eight hundred 
and sixty-seven distinct units were categorized into 14 biomes in 8 biogeo- 
graphical realms“. Of these 14 biomes, three are relevant to our results: (1) 
tropical and subtropical moist broadleaf forests, (2) tropical and subtropical 
dry broadleaf forests, and (3) tropical and subtropical coniferous forests. Our 
scaled-up study region was then defined as the full spatial extent encompass- 
ing these three biomes in Central or South America, the region in which our 
network of study sites was located. Total annual precipitation was calculated 
by summing the individual monthly totals provided by WorldClim. Data 
for mean annual rainfall (defined as the average of 1950-2000) and rainfall 
seasonality (defined as variable ‘BIO15’ in WorldClim) was obtained at a 
30s resolution (approximately 1 km x 1 km) from WorldClim (http://www. 
worldclim.org/current), and CWD was obtained from http://chave.ups-tlse. 
fr/pantropical_allometry.htm. We then calculated the total potential AGB 
accumulation over 20 years of secondary forest growth (assuming that the 
initial year 0 condition was a fully cleared area), based on annual rainfall, 
rainfall seasonality, and CWD, because they were included in the best model 
explaining absolute recovery (Extended Data Table 2). The regression equation 
obtained in this study, relating AGB after 20 years with the climatic variables, 
was AGB at 20 yr = 135.17 — 103,950 x 1/rainfall + 1.521983 x rainfall sea- 
sonality + 0.1148 x CWD. Estimated AGB would then indicate the absolute 
biomass recovery potential. We stress that this is a potential biomass recovery 
map based on the relationship between recovery and rainfall for our sites; 
realized local recovery may vary because of differences in local soil conditions, 
land use history, the surrounding matrix, and availability of seed sources. We 
also made an uncertainty map of potential biomass based on the confidence 
interval of the predicted mean AGB at 20 years obtained from the multiple 
regression. We calculated the confidence interval of the predicted AGB at 
20 years for each pixel based on 1/rainfall, CWD, and rainfall seasonality. The 
relative uncertainty for each pixel was calculated as 100 x (0.5 x 95% confi- 
dence interval of the mean)/predicted AGB. For the maps we focused on low- 
land forest (<1,000 m altitude), between the Tropics of Cancer and Capricorn. 
To make the maps, we had to extrapolate very little beyond our climatic param- 
eter space; our sites cover a rainfall range from 750 to 4,000 mm yr. Sites with 
rainfall >4,000 mm yr! (where we extrapolate with our equation) cover only 
0.88% of the lowland Neotropics. 
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Extended Data Figure 1 | Relative recovery of AGB after 20 years pasture, N= 2; PA, pasture, N= 9; means + s.e.m. are shown). Relative 
in relation to abiotic factors, forest cover, and land use. a, Annual recovery is expressed as the ratio of AGB after 20 years over median AGB of 
precipitation; b, CWD; ¢, rainfall seasonality; d, CEC; e, percentage old-growth forest (as a percentage). Regression lines are shown while 
forest cover in the surrounding matrix; f, previous land use (SC, shifting keeping the other variable constant at the mean value across sites (P= 0.040 
cultivation, N= 17; SC & PA, some plots shifting cultivation, some plots for 1/rainfall, P=0.027 for CEC, R?=0.23, N= 28 Neotropical forest sites). 
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Extended Data Figure 2 | AGB recovery after 20 years in relation to 


abiotic factors, forest cover, and land use. a, Rainfall seasonality; b, CEC 
c, percentage forest cover in the surrounding matrix; d, previous land use 


(SC, N= 19; SC & PA, N=9; PA, N= 15; means +s.e.m. are shown). 


LETTER 


250 
vs 200 
P= 
oD) 
= 150 
s, 
o) 
“100 
oO 
a 
Q 50 

0 

0 10 20 30 40 50 60 
CEC (cmol kg) 

250 7 g 
*- 200 
oO 
Ff oy 
oD 
= 150 
Ss 
oO 
“100 
Oo 
a 
2 50 

0 

Re SC & PA PA 


Previous land use type 


For rainfall seasonality, the regression line is shown based upon the 
multiple regression model that also includes rainfall and CWD, and where 
these variables were kept constant at the mean value across sites (two-sided 
P=0.003, see Fig. 2 for these models for rainfall and CWD). 
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Extended Data Figure 3 | Uncertainty map of potential biomass In general the uncertainty is low: 80.32% of the mapped area has 
recovery of Neotropical secondary forests. The uncertainty is based on an uncertainty less than 20%, and 10.2% of the mapped area has an 

the 95% confidence interval of the mean predicted AGB after 20 years (see —_ uncertainty between 20% and 30%. Because it is a relative uncertainty, it is 
Fig. 3 and Methods). It is expressed as a percentage of the predicted AGB: highest in the driest areas, which have a low predicted biomass. 

100 x (0.5 x 95% confidence interval of the mean)/predicted AGB. 
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Extended Data Figure 4 | Relationship between forest biomass and regression lines are based are shown (N= 1,364 for AGB, N= 995 for AGB 
stand age using chronosequence studies in Neotropical secondary recovery). AGB recovery is defined as the AGB of the secondary forest 
forest sites. a, AGB (N= 44); b, AGB recovery (N= 28). The same plot compared with the median AGB of old-growth forest plots in the area, 
as Fig. 1 but with plots and regression lines coloured by forest type: multiplied by 100. Significant relations (two-sided P < 0.05) are indicated 
green, dry forest (<1,500 mm rainfall per year); light blue, moist forest by continuous lines, non-significant relationships (two-sided P > 0.05) are 


(1,500-2,499 mm yr~'); dark blue, wet forest (>2,500 mm yr’). Each line indicated by broken lines. Plots of 100 years old are also second-growth. 
represents a different chronosequence. The original plots on which the 
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Extended Data Figure 5 | Potential biomass recovery map of 
Neotropical secondary forests. The same as Fig. 3 but with colour- 
blind-friendly colour coding. The total potential AGB accumulation 
over 20 years of lowland secondary forest growth was calculated 

on the basis of a regression equation relating AGB with annual 
rainfall (AGB = 135.17 — 103,950 x 1/rainfall + 1.522 x rainfall 


seasonality + 0.1148 x CWD; see Methods). The colour indicates the 
amount of forest cover recovery (purple, low recovery; green, high 
recovery). The 44 study sites are indicated by circles; the size of the 
symbols scales with the AGB attained after 20 years. The grey areas do not 
belong to the tropical forest biome. The map focuses on lowland tropical 
forest (altitude <1,000m). 
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Extended Data Table 1 | Overview of the sites included in the study 


Site Country Lat. Long. Rainfall CWD RS CEC FC LU AGB %AGB Age Ref. 


Bolpebra Bolivia -11.44 -69.16 1800 -301 55 6.4 80 sc 193.2 63.5 447 44 
El Tigre Bolivia -11.98 -65.72 1780 -375 66 5.8 90 sc 145.0 323 3-25 45 
EI Turi Bolivia -11.75 -67.33 1833 -288 58 5.8 95 Sc 129.4 47.8 2-40 45 
Salvatierra Bolivia -15.50 -63.03 1200 -~642 61 4.0 70 sc 33:3 29.5 1-36 46 
San Lorenzo Bolivia -16.70 -61.87 1129 -718 59 45 40 sc 82.8 64.5 3-50 47 
Surutu 1 Bolivia -17.50 -63.50 1600 -496 54 6.1 45 sc 60.5 - 420 48 
Surutu 2 Bolivia -17.50 -63.50 1600 -496 54 6.1 45 Sc 154.3 - 6-25 48 
Bahia Brazil -14.48 -39.09 2000 -7 17 6.6 80 Sc 96.7 25.4 10-40 49 
Cajueiro Brazil -14.98 -43.90 840 -907 96 13.8 80 PA 52:2 - 15-59 
Serra do Cipo Brazil -19.35 -43.62 1519 -341 86 5.4 60 PA 130.3 - 8-50 
Eastern Para 1 Brazil -4.26 -47.73 1898 -548 88 7.8 50 SC &PA 168.2 - 5-25 50 
Eastern Para 2 Brazil -2.16 -47.38 2460 -274 64 46 52 SC &PA 224.7 - 5-40 50 
Eastern Para 3 Brazil -1.17 -47.75 2785 -306 68 4.3 58 PA 176.5 - 2-70 50 
Manaus 

(Cecropia pathway) Brazil -2.35 -60.00 2400 -16 32 2.0 100 SC 181.2 55.8 5-31 51 
Manaus 

(Vismia pathway) Brazil -2.35 -60.00 2400 -16 32 2.0 100 PA 140.6 43.3 2-25 51 
Mata Seca Brazil -14.86 -43.99 825 -914 95 35 80 PA 58.8 - 14-58 §2 
Patos Brazil -7.02 -37.25 750 -1276 116 11.2 60 SC & PA 20.3 - 22-64 ee 
Rio Madeira 

(anthropogenic soil) Brazil -5.77 -61.43 2507 -109 46 3.4 79 Sc 187.9 74.4 5-30 54 
Rio Madeira 

(oxisol) Brazil -5.77 -61.43 2507 -109 46 4.7 79 Sc 159.2 56.2 6.5-30 54 
Sao Paulo Brazil -22.32 -47.57 1367 -297 72 10.2 31 PA 123.2 35.7 11-45 
Araracuara Colombia -0.60 -72.37 3059 -13 29 9.7 95 sc 134.5 39.2 2-30 55 
Providencia Island Colombia 13.35 -81.35 1584 31.8 60 SC &PA 135.9 - 6-56 56 
San Carlos Colombia 1.93 -67.05 3565 0 23 5.8 80 sc 84.6 35.5 9-80 57 
Santa Rosa 

(oak forest) Costa Rica 10.89 -85.60 1765 -608 91 32.4 70 PA 161.3 - 5-70 58 
Palo Verde Costa Rica 10.36 -85.31 1444 -~622 88 39.0 70 PA 142.9 - 7-60 58 
Santa Rosa Costa Rica 10.86 -85.61 1765 -621 93 32.4 70 PA 170.9 84.9 6-70 58 
Sarapiqui (Chazdon) Costa Rica 10.43 -83.98 4000 0 29 29.0 7 PA 171.5 59.6 10-41 59 
Sarapiqui (Letcher) Costa Rica 10.43 -83.98 4000 0 29 29.0 wT PA 145.4 76.6 10-42 60 
Chajul Mexico 16.07 -90.75 3000 -248 62 10.8 50 sc 105.3 59.4 0-27 61 
Chamela Mexico 19.50 -105.05 788 -905 109 21.0 60 PA 34.8 49.9 3-15 62 
Chinantla Mexico 17.75 -96.65 3700 -596 63 10.1 95 SC 155.7 63.8 5-50 
El Ocote 1 Mexico 16.96 -93.63 1500 -558 69 26.3 80 sc 101.4 33.6 2-75 63 
El Ocote 2 Mexico 16.96 -93.63 2000 -558 69 18.6 80 sc 116.4 62.3 3-75 63 
Lacandona Mexico 16.54 -90.96 2500 -325 68 37.0 87 SC &PA 206.3 - 1-31 
Marqués de 
Comillas Mexico 16.33 -90.67 2250 21.4 49 SC &PA 206.3 - 2-50 
Nizanda Mexico 16.66 -95.01 878 -1036 102 306 64 sc 34.7 25.4 2-60 64 
Quintana Roo Mexico 19.31 -88.57 1250 -508 54 64.6 85 SC&PA 90.0 66.4 2-100 
Yucatan Mexico 20.08 -89.55 1100 -601 65 39.9 94 sc 69.3 66.7 3-70 65 
Zona Norte Mexico v.27 -91.66 2750 -213 56 37.0 67 SC &PA 85.6 - 2-52 
Agua Salud Panama 9.22 -79.78 2700 -326 57 9.3 70 SC &PA 81.8 - 2-31 66 
Barro Colorado 
Island Panama 9.15 -79.85 2600 -319 57 9.3 100 SC&PA 120.5 59.3 20-100 67 
Pucallpa Peru -8.50 -74.80 1570 -56 34 21.4 43 SC &PA 99.2 - 5-30 
Cayey Puerto Rico 18.02 -66.08 2000 -220 41 23.9 50 PA 110.5 61.8 10-80 68 
EI Carite Puerto Rico 18.08 -66.07 1822 -165 35 20.5 1 PA 127.7 66.7 477 69 
Luquillo Puerto Rico 18.34 -65.76 2660 -12 31 9.5 ee] PA 150.1 67.8 9-76 69 


An overview of the name, country, latitude (Lat.) and longitude (Long.), rainfall (in millimetres), rainfall seasonality (RS), CWD (in mm per year), CEC (in centimoles of positive charge per kilogram 
of soil), percentage forest cover in the landscape matrix (FC; as a percentage), prior land use (LU: SC, shifting cultivation; SC & PA, some plots shifting cultivation, some plots pasture; PA, pasture), 
AGB (in megagrams per hectare) after 20 years, relative biomass recovery after 20 years (%AGB, as a percentage), the minimum and maximum secondary forest age (in years) included in the 
chronosequence, and a reference (refs 44-69) for each site is given. 
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Extended Data Table 2 | Overview of the modelling results of absolute 
(N=43, one site was excluded because of missing climatic data) and 
relative (N= 28) AGB recovery after 20 years in relation to rainfall, 
CEC, land use, and forest cover in the landscape matrix 


Variable Model AAIC. R 


AGB at 20 yr (Mg/ha) —_1/rainfall + CWD + Rainfall seasonality 0.00 0.59 


AGB at 20 yr (%) 1/rainfall + CEC 0.00 0.23 


1/rainfall + CEC + Rainfall seasonality 1.53 0.27 


All models within two AIC, units of the best model are given. The change in AIC for small sample sizes 
compared with the best model (AAIC,) and R? are given. 


© 2016 Macmillan Publishers Limited. All rights reserved 


LETTER 


doi:10.1038/nature16510 


New geological and palaeontological age constraint 
for the gorilla~human lineage split 


Shigehiro Katoh!, Yonas Beyene**, Tetsumaru Itaya*, Hironobu Hyodo*, Masayuki Hyodo', Koshi Yagi®, Chitaro Gouzu®, 
Giday WoldeGabriel’, William K. Hart®, Stanley H. Ambrose®, Hideo Nakaya!®, Raymond L. Bernor!, Jean-Renaud Boisserie*"’, 
Faysal Bibi!’, Haruo Saegusa!*, Tomohiko Sasaki’, Katsuhiro Sano’, Berhane Asfaw!° & Gen Suwa!® 


The palaeobiological record of 12 million to 7 million years 
ago (Ma) is crucial to the elucidation of African ape and human 
origins, but few fossil assemblages of this period have been 
reported from sub-Saharan Africa. Since the 1970s, the Chorora 
Formation, Ethiopia, has been widely considered to contain 
~10.5 million year (Myr) old mammalian fossils!~’. More recently, 
Chororapithecus abyssinicus, a probable primitive member of the 
gorilla clade®, was discovered from the formation. Here we report 
new field observations and geochemical, magnetostratigraphic and 
radioisotopic results that securely place the Chorora Formation 
sediments to between ~9 and ~7 Ma. The C. abyssinicus fossils 
are ~8.0 Myr old, forming a revised age constraint of the human- 
gorilla split. Other Chorora fossils range in age from ~8.5 to 
7 Ma and comprise the first sub-Saharan mammalian assemblage 
that spans this period. These fossils suggest indigenous African 
evolution of multiple mammalian lineages/groups between 10 
and 7 Ma, including a possible ancestral-descendent relationship 
between the ~9.8 Myr old Nakalipithecus nakayamai® and 
C. abyssinicus. The new chronology and fossils suggest that faunal 
provinciality between eastern Africa and Eurasia had intensified 
by ~9 Ma, with decreased faunal interchange thereafter? '. 
The Chorora evidence supports the hypothesis of in situ African 
evolution of the Gorilla-Pan-human clade, and is concordant 
with the deeper divergence estimates of humans and great apes 
based on lower mutation rates of ~0.5 x 10~° per site per year 
(refs 13-15). 


3,12 


The Chorora Formation type locality fauna’? has been known as one 
of few key windows revealing Late Miocene mammalian evolution in 
sub-Saharan Africa. In 2006/07 the fossil ape C. abyssinicus® was discov- 
ered at Beticha, a locality ~3 km from the type locality (Fig. 1). Because 
Chororapithecus is considered a strong candidate for membership in 
the gorilla clade’, its age is important in temporally constraining the 
gorilla~human species divergence!*4, The probable cladistic position 
of Chororapithecus (Extended Data Fig. 1) is based on a suite of derived 
molar features shared with gorillas, thought to indicate an early phase 
of increased herbivory’®. 

The >10 Myr age of the Chorora Formation was proposed on the 
basis of stratigraphic and geochronological frameworks established in 
the 1970s'”. In the 1990s and 2000s, additional radioisotopic dating and 
biochronological considerations refined this chronology**. In 2007, 
the Chororapithecus-bearing strata were correlated to the Chorora type 
locality sequence and thereby thought to be older than 10 Ma (ref. 6). 
Here we present chronostratigraphic data from new field research, 
and show that the Chorora Formation sediments are predominantly 
younger than 9 Ma. We also present an overview of the newly recovered 
Chorora Formation fossils, which fill the gap between sub-Saharan 
fossils that are either older than 9-9.5 Ma (refs 7, 8, 16) or younger than 
7-7.4 Ma (refs 17, 18). The Chorora fossils thus provide new insights 
into mammalian evolution in Africa, in a period crucial to elucidating 
the emergence of the African ape and human clade’. 

The Chorora Formation extends northeast to southwest for >100km 
at the transition zone between the southern Afar Rift and the northern 


Figure 1 | Location of Chorora Formation sedimentary exposures. 

a, Digital elevation map of eastern Africa and Late Miocene faunal sites. 
1, Chorora; 2, Lothagam; 3, Samburu Hills; 4, Nakali. b, Transition zone 
of the Main Ethiopian Rift (MER) and the Afar Rift. Dotted line indicates 
extent of Chorora Formation sediments!”. Yellow stars and circles are 


5-7 Myr old volcanic centres (northeast to southwest: Afdem, Asabot 
and Gumbi) and flows!””!. Red square shows location of c. c, Chorora 
Formation exposures and localities. a, b, Images from the National 
Geophysical Data Center; c, air photograph provided by the Ethiopian 
Mapping Agency. 
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Figure 2 | Summary of Chorora Formation fault block relationships. 
a, Schematic comparison of hypotheses. Left: rhyolitic ignimbrite units 
above and below the Chorora Formation sediments. Right: successive fault 
blocks with single rhyolitic ignimbrite unit. b, Fault block relationships 


Main Ethiopian Rift!”!? (Fig. 1). The formation overlies Miocene 
basaltic lavas (Trap Series) ranging from >23 to 9 Ma, with interca- 
lated silicic units’!**’. Recent studies suggest that a topographically 
low pre-rift landscape prevailed in this region until ~11 to 10 Ma 
(refs 19, 20). The Chorora Formation accumulated in a marginal 
graben formed along the emerging southeastern Afar Rift border fault 
as extension between the Nubian and Somali plates accelerated in the 
Late Miocene”. We show that lacustrine and fluvial sedimentation of 
the Chorora Formation occurred between 9.2 and ~7 Ma. Thereafter, 
substantial rift floor subsidence occurred further northwest, as inferred 
from the chain of volcanic centres and widely distributed volcanic flows 
dated between ~7 and 5 Ma (refs 19-21). 

The ~10.5 Myr age formerly attributed to the Chorora Formation 
type locality fossils was based on stratigraphy and geochronology con- 
ducted by previous researchers'~*. At the type locality and adjacent 
areas, sedimentary packages were assigned to ‘series 1’ and ‘series 2’ in 
ascending stratigraphic order’. The type locality fossils were collected 
from a sandy unit of the ‘series 2’ sediments that was assumed to underlie 
rhyolitic ignimbrite units dated to ~10 Ma (refs 1-3). 

We focused on refining the chronostratigraphy of both the type 
locality and the Chororapithecus-bearing Beticha locality. Because 
the Beticha and type locality sections differ in the represented sedi- 
mentary units*®, we investigated a range of localities in establishing a 
more comprehensive Chorora Formation chronostratigraphic frame- 
work. Despite variation among localities, we observed a distinct fine- 
grained welded rhyolitic ignimbrite unit to consistently underlie the 
Chorora Formation sediments. These localities occur as northeast- 
southwest trending, variably (mostly antithetically) tilted fault blocks, 
often resulting in the underlying resistant volcanic strata elevated rel- 
ative to the adjacent fault block (Fig. 2). At the type locality, we could 
not identify a section where the rhyolitic ignimbrite unambiguously 
overlies the Chorora Formation sediments. Instead, we observed sec- 
tions where the rhyolitic ignimbrite unit previously considered to 
overlie the Chorora Formation sediments (hereafter RhnyOv1) occurs 
in fault contact with the uppermost ‘series 2’ sediments (Extended 
Data Fig. 2). A possibly second ignimbrite (RhyOv2) occurs ~20m 
topographically higher than RhyOv1 and forms the local ridge top 
>200 m west of the main type locality exposures. We hypothesized 
that both RhyOv1 and RhyOv2 are step-faulted occurrences of the 
rhyolitic ignimbrite unit (RhyUn) that directly underlies the Chorora 
Formation sediments. Previously, these flows yielded whole-rock 
K-Ar ages? of 9.84+0.6 (RhyOv2), 10.1+0.6 (RhyOvl) and 
11.0 +0.9 (RhyUn) Ma (here and elsewhere errors are 2). To ver- 
ify these dates using current K-Ar techniques (see Methods), we 
analysed chemically homogeneous 100-200 mesh (150-74 jum) 
feldspar separates. The K—Ar ages, feldspar chemistry and remanent 
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9.02 + 0.16 (n = 2) 


at the type locality (upper) and Beticha (lower). Numbers are K—Ar ages 
and errors (2c) in millions of years. Weighted means are shown with 
number of samples in parenthesis. Red dots, fossil horizons; dotted lines, 
depositional hiatus. 


magnetism were compared across samples to test the hypothesis that 
RhyOvl, RhyOv2 and RhyUn are correlative. 

We sampled the type locality RhyUn (n =4, weighted mean age 
of 9.18 + 0.12 Ma) at the same section as previous workers, and the 
RhyOv1l and RhyOv2 at exposures continuous with the previous 
sample locations (Extended Data Fig. 2). These yielded K-Ar ages 
of 9.36 + 0.20 (RhyOvl, n= 3) and 9.31 £0.23 (RhyOv2, n =2) Ma, 
respectively, slightly older than—but statistically indistinguishable 
from—the RhyUn dates (Extended Data Table 1). Feldspar chemistry 
is comparable across sections and samples (Extended Data Fig. 3), 
suggesting a single flow unit or closely spaced successive units. All 
measured RhyUn, RhyOv1 and RhyOv2 samples (n=5) exhibited 
reverse magnetization. Our results indicate that the hypothesis of 
RhyUn, RhyOv1 and RhyOv2 equivalency cannot be rejected. We also 
investigated rhyolitic ignimbrite sections at Beticha that are litholog- 
ically and stratigraphically indistinguishable from the type locality 
sections (Fig. 2). We analysed three rhyolitic ignimbrite samples 
from Beticha. These resulted in K-Ar ages (weighted mean age: 
9.04 + 0.13 Ma), feldspar chemistry and palaeomagnetism concordant 
with the type locality RhyUn/Ov1/Ov2. 

The weighted mean age of 12 RhyUn/Ov1/Ov2 and correlative sam- 
ples is 9.17 + 0.08 Ma. However, the individual K-Ar ages varied from 
8.76 + 0.21 to 10.08 + 0.45 Ma, suggesting uncertainties exceeding 
experimental error. Step-heating *°Ar—*’Ar analysis of feldspars 
resulted in an older weighted mean age of 9.66 + 0.22 Ma (n= 2) 
(Extended Data Fig. 4 and Extended Data Table 1). Combined with 
the consistently reverse magnetization of RhyUn/Ov1/Ovz2, these dates 
constrain the unit within Chron C4Ar (9.03-9.74 Ma) (Fig. 3). 

The above results and other analyses enable construction of a 
composite Chorora Formation stratigraphy across eight fossiliferous 
localities (Fig. 3 and Extended Data Figs 5 and 6). At the type locality, 
the >20 m thick ‘series 2’ section that stratigraphically overlies the 
RhyUn consistently exhibits reverse polarity. A tuff from the middle 
of this interval, just above the main vertebrate fossil horizon, yielded a 
K-Ar age of 8.73 +0.21 Ma. This suggests that the long type locality inter- 
val of reverse magnetism corresponds to Chron C4r-2r (8.25-8.70 Ma) 
and that the type locality fossils are ~8.5 Myr old. 

At Beticha, a gravelly sand unit containing the Chororapithecus 
fossils occurs in the upper portion ofa >40m thick sequence, just above 
a pumiceous tuff dated to 8.02 Ma (average of 8.18 + 0.15 by K-Ar 
and 7.86 £0.10 by *Ar—-*°Ar). An ~5 m predominantly silty interval 
overlies this fossil-bearing unit. This is capped by a consolidated tuff 
dated to 7.74 Ma (average of 7.67 £0.17 by K-Ar and 7.82 £0.11 by 
°ar—°Ar). A magnetic transition occurs just below the 8.02 Myr old 
tuff. The overlying sediments are normally magnetized, suggesting 
that this transition corresponds to the C4r—C4n boundary (8.072 Ma). 
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Figure 3 | Chronostratigraphy of the Chorora Formation. Schematic 
summary across eight localities. The type locality and Golocha sections 
are shown in the same column. Red ovals, fossil horizons. Numbers are 
radioisotopic dates and errors (2c) in millions of years; weighted means 


Interpolating between this and the capping 7.74 Ma age yields an 8.0 Ma 
estimate for the Chororapithecus and associated fossils. 

At the type locality, Beticha and elsewhere, the upper ‘series 2’ or 
higher sediments are of variable lithology and age. At Chifara, a deep 
‘series 1’ sequence is in fault contact with an upper tuffaceous/fluvial 
sequence containing a fossil-bearing unit. The K—Ar ages and normal 
magnetization suggest a 7.0-7.15 Ma age of the fossils. At three other 
newly established localities (Gutosadeen, Odakora North and Teso 
Tadecho), mammalian fossils occur in sands intercalated in floodplain 
silts that overlie an ignimbrite with a K-Ar age of 7.57 £0.21 Ma and 
underlie basalts with ages of 6.72 + 0.16 and 6.94 +0.17 Ma (Fig. 3). 

Finally, we tested the above-established Chorora Formation chron- 
ostratigraphy by comparing the Chorora fossils with those of preced- 
ing and succeeding eastern African assemblages, in particular those 
of Nakali/Samburu Hills (9.9-9.5 Ma) and Lothagam Lower Nawata 
Member (7.4-6.5 Ma) of Kenya (Extended Data Table 2). The descrip- 
tive details of the Chorora Formation fauna are presented elsewhere. 
The Chorora type locality and Beticha assemblages share archaic taxa 
with Nakali/Samburu Hills, but also have distinctly derived elements. 
Notable first appearances include a low-crowned primitive species of 
hippopotamus at Beticha. This new taxon morphologically bridges 
Kenyapotamus and the more advanced Hippopotaminae that com- 
prise the post-7.4 Ma hippopotamid diversity'’. A primitive-looking 
large bovid premolar may represent an early bovin or boselaphin. 
The ~8.5-8 Ma Chorora cercopithecid teeth include at least two 
species of colobine and a cercopithecine. The latter appears to be a 
small papionin, so far the earliest recognized record of Cercopithecinae. 
The dominant colobine is larger than the 10-9 Myr old Kenyan 
Microcolobus®3, and has conservative molar cusp notches as in Late 
Miocene Eurasian Mesopithecus spp. Another taxon is similar in molar 
size to Microcolobus tugenensis but differs in details. A third colobine 
may be represented by a mandibular molar that is higher crowned 
with a deep lingual notch well within modern colobine ranges of vari- 
ation'”**, These new cercopithecid fossils suggest considerable hidden 
diversity in the fossil-poor early to middle Late Miocene of Africa. The 
Chorora hipparionins suggest that two lineages, a larger and smaller 


are shown with number of samples in parenthesis. The composite section 
shows summary ages: asterisks indicate average K-Ar ages, or averages of 
K-Ar and “°Ar—*?Ar ages (using weighted means). 


species, existed and went through a mostly in situ African transfor- 
mation during the Late Miocene”: from the ~9.5 Ma Samburu Hills 
‘Cormohipparion aff. africanum (most likely two species), through 
the Chorora hipparionins, to the post-7.4 Ma Eurygnathohippus 
turkanensis and E. feibeli. The newly recognized upper Chorora fauna, 
estimated at 7.5-7.0 Ma, shares more taxa with the Lothagam Lower 
Nawata Member. 

The Chorora fossils suggest that multiple indigenous African 
mammalian lineages developed between ~9.5 and 7 Ma, including 
hippopotamines, nyanzachoere suids, bovids, hipparionin equids and 
cercopithecids. However, inter-relationships with Eurasia must also 
be considered. In particular, the hipparionin evidence shows a parallel 
appearance of large robust forms across continents: the Samburu Hills/ 
Chorora/Lothagam 9.6-7 Ma sub-Saharan African sequence and the 
<9-7 Ma Siwalik Sivalhippus spp. It is probable that eastern Africa 
had a closer bioprovincial relationship with southern Asia than with 
western Eurasia!!», and that population and species lineage interchange 
intermittently occurred between continents. In sub-Saharan Africa, 
the immigrant hipparionins became the first to extensively exploit Cy 
grasses as early as ~9.6 Ma in Kenya”® and by ~8.5 Ma at Chorora”’. 
The emerging diversity of African cercopithecid monkeys in the Late 
Miocene implies that from such a pool of diversity only restricted 
lineages actually extended into Eurasia, for example, the semi-terrestrial 
colobine Mesopithecus (~8.5-7 Ma) of the Eurasian open-woodland 
‘Pikermian paleobiome’ (~9.5-6.5 Ma)’. Finally, the early Alilepus 
recorded at Chorora (most probably 7.5-7 Ma) indicates that the 
‘leporid event’** penetration into Africa might have occurred near- 
simultaneously with southern Asia (recorded at 7.4 Ma). These Afro- 
Eurasian links suggest an increasingly filtered faunal exchange after 
~9 Ma (refs 9-12). 

The Chororapithecus-bearing Beticha fauna is distinct from that 
of the type locality in taxonomic relative abundances®”, exhibiting 
clear dominance of primates, hippopotamids and browsing ungulates. 
Enamel isotopic analyses indicate that these early hippopotamids (most 
abundant next to primates at Beticha) appear to have been near-pure 
C3 feeders (81°C < —12.0; Extended Data Table 3). Combined with the 
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very low 6'8O values of the two individuals, the data suggest that the 
primitive Beticha hippopotamids fed in humid, closed habitats that 
lacked Cy plants. Although more evidence is needed, it is probable 
that, especially after ~9 Ma, faunal interchange between Eurasia and 
sub-Saharan Africa was increasingly limited, with potential routes not 
conducive to dispersal of closed-habitat-dependent species lineages, 
including those of primates. 

Our new age-estimate of Chororapithecus at ~8 Ma revises its chron- 
ocladistic relationship with Nakalipithecus (Extended Data Fig. 1). 
It provides a minimum age of the gorilla~human species lineage 
split at either ~8 or ~10 Ma, depending on the cladistic position of 
Nakalipithecus. The Kenyan N. nakayamai lacks some of the derived 
molar features of C. abyssinicus, but shares with C. abyssinicus large 
postcanine size and possibly a prominent transverse crest in the lower 
molars (inferred from wear pattern). Thus, N. nakayamai may represent 
a stem member of the gorilla clade more primitive than C. abyssinicus, 
or precede the gorilla split as previously suggested®. 

Until recently, many researchers preferred a relatively recent 
timing of modern great ape and human divergences, a 4-6 Ma split 
for humans and chimpanzees and a <8 Ma split for humans and 
gorillas>'*??. However, genomic modelling of modern great ape and 
human divergence timings based on a comparatively low intergener- 
ational mutation rate'*-'* are in accord with the fossil evidence. The 
Chorora evidence effectively refutes a 9-10 Ma Eurasian origin of the 
African ape and human clade”, and highlights the need of more fossils, 
especially those of large-bodied apes from the 12 to 7 Ma time interval 
of sub-Saharan Africa?®?”, 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized. The investigators were not blinded to allocation during 
experiments and outcome assessment. 

Lithostratigraphic and tephrostratigraphic correlations. The Chorora Formation 
sediments and volcanic rocks were mapped and described in the post-2010 field 
surveys at the following localities: type locality, Beticha, Gololcha, Chifara, Adadi, 
Gutosadeen, Odakora North and Teso Tadecho. Synthesized lithostratigraphic 
columns were constructed for each locality and correlations between localities 
were established (Extended Data Figs 5 and 6). The type locality and Beticha sedi- 
ments are dominated by diatomaceous clay, silt and intercalated tuffs. At Gololcha 
and the lower Chifara sections, coarser tuffs and pyroclastic flows dominate, with 
intercalated partly diatomaceous silt and sand beds. These were compared with 
previous depictions of the type locality and nearby stratigraphy**". At Adadi and 
Gutosadeen, stratigraphically higher sediments unconformably overlie the sedi- 
ments correlative to the type locality and Beticha lower beds. 

Tephrostratigraphy of the Chorora Formation tuffs was compiled on the basis 
of lithology (thickness, colour, grain size, unit division), stratigraphic relation- 
ships and petrographic properties (including morphology and refractive indices 
of volcanic glass shards). Probable correlative tephra layers were established by the 
analytical results (Supplementary Table 1). Eight tuffs (CT-1 to CT-8) were con- 
firmed to be correlative by chemical fingerprinting of discrete volcanic glass shards 
(Supplementary Tables 1 and 2). Major element compositions were measured by an 
electron-probe micro-analyzer and processed following established methods. 
The coefficient of similarity values*! were determined using eight major element 
oxides: SiOz, TiO2, AlO3, FeO, MnO, CaO, Na2O and K;0. All coefficient of simi- 
larity values calculated for correlative pairs exceeded 0.91 (Supplementary Table 1). 
For two of the tephra (CT-5 and CT-6), after examining chemical homogeneity of 
glass shards, trace element compositions of glass shards were measured by direct 
current plasma optical emission spectrometry (DCP-OES) following established 
methods**?> (Supplementary Table 2). For three of the tephra (CT-4, CT-7 and 
CT-8), 10 major and 21 trace elements were also analysed by inductively coupled 
plasma optical emission spectrometry (ICP-OES) (Agilent 720ES ICP-OES by 
W. K. Hart and J. P. Morton). One sample of tuff CT-5 (07T-2) was analysed in 2010 
(DCP-OES) and again in 2014 (ICP-OES), along with in-house glass standards and 
an international standard reference material, to establish method/instrument com- 
parability and overall analytical accuracy and precision (Supplementary Table 2). 

The ICP-OES techniques were similar to those previously used for DCP-OES 
analyses with the following important distinctions. Major and trace elements were 
analysed simultaneously using a single dissolution of 50 mg of sample powder fused 
with 75 or 100 mg of lithium metaborate and dissolved in 125 ml of 1% nitric acid. 
The resulting solution had a low enough amount of total dissolved solids to be 
run without further dilution. Powders of 12-14 standard reference materials were 
used as external calibration standards, and were chosen to emphasize the range of 
compositions of the unknown materials. These were prepared the same way and 
fused at the same time as the unknowns. Samples and standards were analysed 
within 24h after fusion to minimize losses of Si and Zr. 

K-Ar dating. Fresh volcanic rocks (ignimbrites, lavas and tuffs) were collected for 
K-Arand “°Ar—**Ar dating. Sample preparation and K—Ar dating was conducted 
following established methods**””. Feldspar separates of 60-100, 100-200, 150-200, 
150-300 or 200-300 mesh size fractions (250-150 1m, 150-741m, 105-74|1m, 
105-48 1m and 74-48 |.m, respectively) were prepared for analysis and examined 
to assess chemical homogeneity by electron-probe microanalysis following estab- 
lished methods*’”. Seven major element oxides, SiOz, TiO, Al,O3, FeO, CaO, Na,O 
and K,O, and two minor oxides, SrO and BaO, were analysed using at least 25 crys- 
tals for each sample. The results are shown in the Or-Ab-An (K-Na-Ca) diagrams 
(Extended Data Fig. 3) and in Supplementary Table 3; the crystals analysed were 
predominantly alkali-feldspars (orthoclase and anorthoclase) and minor plagi- 
oclases in the felsic units, or plagioclase in the basalt lava samples. A higher chem- 
ical homogeneity and finer-grained separates generally result in higher accuracy 
of K-Ar ages, so that 100-200 or 150-200 mesh size fractions were preferentially 
used, except for the basalt lava samples for which 150-300 or 200-300 mesh size 
fractions were dated to further minimize the heterogeneity effect. 

40Ar—*? Ar dating. Single crystal step-heating ‘?Ar—**Ar ages of selected samples 
were measured for comparisons with the K-Ar ages. Fresh coarse-grained anortho- 
clase crystals (0.5-1.5 mm in size) were taken from the feldspar separate prepared 
for the K-Ar dating. Ages of individual crystals were determined by stepwise 
degassing using a near-infrared diode laser***. Plateau ages were calculated from 
fractions that spanned over 75% of the total released *Ar and overlapped each 
other within 20 error®, except for two samples (10TC-2 and 10TC-42) in which 
higher calcium phases caused age shifts. 

At Beticha, units above and below the Chororapithecus fossil-bearing sands were 
dated. Two samples of the overlying consolidated tuff were taken ~5 m apart at 
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(10TC-58) or near (10TC-1) the cliff face just north of the Chororapithecus discovery 
area. From these two samples, a total of six individual crystals were analysed 
(Extended Data Fig. 4). The step-heating single crystal ages ranged from 7.59 + 0.24 
(2c) to 8.19 + 0.26 Ma with a weighted mean age of 7.82 + 0.11 Ma. Two samples of 
the underlying pumiceous tuff (10TC-52 and 10TC-53) were taken ~30 m apart just 
downslope and down section of the Chororapithecus fossil-bearing unit. From the 
two samples, a total of six individual crystals were analysed (Extended Data Fig. 4). 
The step-heating single crystal ages ranged from 7.70 + 0.28 to 7.93 + 0.24 Ma 
with a weighted mean age of 7.86 + 0.10 Ma. We also determined the step-heating 
single crystal “°Ar—*’Ar ages of the rhyolitic ignimbrite at Beticha (10TC-02) and 
Gololcha (10TC-42). Two feldspar crystals were measured for each sample, yielding 
plateau ages of one crystal each (10.20 + 0.48 and 9.52 + 0.24 Ma, respectively). 
Magnetostratigraphy. Oriented block samples were taken from the Chorora 
Formation sediments and volcanic rocks following established methods‘®. Cubic 
subsamples were cut from weakly consolidated sediments with stainless and alu- 
minium saws and sand papers and put into cubic holders of 10 cm*. Columnar 
subsamples of about 10cm? were mechanically cored from the hard rock samples. 
Stepwise alternating field and thermal demagnetizations were applied to these 
subsamples in 15-17 steps (2.5-180 mT and 100-680 °C), using a 2G cryogenic 
magnetometer and a Barington MS2 susceptibility meter. After confirming the 
natural remanent magnetizations with a linear decay towards the origin, character- 
istic remanent magnetization directions were calculated using principle component 
analysis‘'. A maximum angle deviation of 11.0° for at least five successive steps was 
accepted as the criterion in determining characteristic remanent magnetization 
direction. Mean palaeomagnetic directions were calculated for each sample hori- 
zon on the basis of the characteristic remanent magnetization directions for dec- 
lination, inclination, precision parameter (k) and radius of 95% confidence circle 
(a95) as well as virtual geomagnetic pole latitude and longitude (Supplementary 
Table 4). The mean palaeomagnetic directions were less than 20° of «95 in most 
of the sample horizons. The results indicate an upward normal-reverse-normal 
polarity change within the Chorora Formation sediments (‘series 1’ and ‘series 2’ 
sediments of Fig. 3) and a reversed polarity of the underlying rhyolitic ignimbrite. 
The primate-fossil-bearing horizon at Beticha occurs just above the transition from 
reverse to normal polarity. Considering our K—Ar and “°Ar—**Ar ages, we interpret 
this transition as correlative to the boundary between C4r and C4n.2n, 8.072 Ma 
according to the Cande and Kent geomagnetic polarity timescale. 

Isotopic analysis of enamel. The results of carbon and oxygen stable isotope 
analysis of equid and hippopotamid tooth enamel are summarized in Extended 
Data Table 3, together with the enamel 5!°C values of four previously reported 
hipparionin teeth*””. We measured enamel °C and 6180 values on three of the 
newly collected hipparionin teeth from the type locality (~8.5 Ma) and two hip- 
popotamid fragments from Beticha (~8.0 Ma) following established methods’. 
Diagenetic carbonate was removed from powdered enamel by treatment for 4h 
with 0.1 M acetic acid. Results are expressed as parts per thousand (%o) differ- 
ence (8) from the !3C/!C and !80/!°O ratios of international standards (Pee Dee 
Belemnitella (PDB) fossil carbonate for carbon; PDB and Vienna Standard Mean 
Ocean Water for oxygen). 

Accurate estimation of past atmospheric CO, 61°C values is critical for recon- 
structing Late Miocene habitats and herbivore diets. The modern C; global average 
83C value, excluding closed canopy forests, is approximately —27%o (ref. 44), and 
the pre-industrial average would have been —25.5%o. Estimates of Neogene air 
83C values based on marine carbonate and organic biomarker isotopic data are 
similar to preindustrial values***°. Air CO2 concentrations are higher and 6'°C 
values are lower in closed canopy forest floor microenvironments. Discrimination 
(A¥C%o = air §'°C%o—plant 6'2C%o) varies substantially with temperature, 
humidity, water stress, CO2 concentration and functional type (evergreen gym- 
nosperms discriminate less than deciduous angiosperms), ranging from 22.5%o in 
evergreen warm mixed forests to 15.6%o in xeric woodland/scrubland”. Tropical 
habitats with mean annual rainfall ~500 to 1,500 mm have estimated average ABC 
values of ~19 to 22%o (ref. 47). High CO. concentrations during the Palaeogene 
and early Neogene favoured C; plants, but decreasing concentrations led to the 
expansion of C, grasslands during the Late Miocene, beginning as early as ~10 Ma. 
Modern browsing species generally do not feed on gymnosperm leaves, so we 
assume that browsing herbivores consumed plants with higher A!°C values, and 
thus more negative §!°C than the total biome average. If the Late Miocene atmos- 
pheric CO; average 83C value was —6.5%o, and precipitation was 500-1,500 mm, 
then the average angiosperm plant AC value may have been ~20%o, so the aver- 
age Late Miocene C; §'°C values would have been —26.5%o. Isotopic enrichment 
from ruminant herbivore diet to tooth enamel carbonate of ~14%o would result 
in °C values of approximately —12.5%o and 2-3%o for pure C; feeders (browsers) 
and pure C, feeders (grazers), respectively. Values between these end-members 
reflect linear proportions of C3- and C4-based foods. The canopy effect can shift 
C; browser enamel 61°C values to as low as —20%o in the Congo forest“. 
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Extended Data Figure 1 | Cladistic relationships of C. abyssinicus 

and N. nakayamai. C. abyssinicus is known from only nine teeth 

(or fragments)° and N. nakayamai from a mandibular corpus with 
M1-M3, and ten other isolated teeth (excluding a possible antimere)*. The 
only informative dental elements shared by the two reported samples are 
the lower M3 and the lower M1 (the latter damaged in C. abyssinicus and 
considerably worn in N. nakayamai). This fragmentary evidence makes 
cladistic evaluations difficult. C. abyssinicus was considered to share the 
following derived combination of features with the modern gorilla: large 
postcanine size, upper molars buccolingually narrow and mesiodistally 
elongate, with a relatively long and mesiobuccally extending mesial 
protocone crest, reduced protoconule, and lower molars with a prominent 
anterior transverse crest. Following ref. 6, we consider Chororapithecus to 
be a basal member of the gorilla clade. When compared with the Middle 
Miocene examples of Kenyapthecini and Equatorini, C. abyssinicus and 

N. nakayamai share reduced cingula in both upper and lower molars, 


bonobo 


Chororapithecus 
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although ref. 8 pointed out that the lower M3 cingulum appears slightly 
better developed in N. nakayamai. The available N. nakayamai teeth 
include elements (not reported in C. abyssinicus) that are morphologically 
more derived than in Kenyapthecini/Equatorini, such as upper premolars 
that are relatively elongate mesiodistally (narrow buccolingually) and a 
lower P3 that is not as obliquely elongate and transversly compressed as in 
the Middle Miocene forms. On the basis of these observations, ref. 8 
preferred a stem modern African ape-human cladistic position of 

N. nakayamai (option 1). However, N. nakayamai also shares two possibly 
derived features with C. abyssinicus: large postcanine size (although 
slightly less so) and prominent transverse crest in the lower molars. The 
latter is inferred from the characteristic wear pattern of the holotype 
mandible lower M1, the dentine exposures of the lingual cusps that are 
buccolingually linear (figure 1 of ref. 8). These observations suggest that 
N. nakayamai may also be a stem member of the gorilla clade but more 
primitive than C. abyssinicus (option 2). 
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RhyOv2 


Extended Data Figure 2 | Chorora Formation type locality. a, View 
north-northwest from the southeastern margin of the type locality. Yellow 
and red arrows show the rhyolitic ignimbrite units considered by previous 
workers to overlie the Chorora Formation sediments’ *. The ignimbrite 
unit (yellow arrows) adjacent to the sediment exposures has been 
interpreted to directly overlie the Chorora Formation sediments including 
the gravel unit that forms the resistant ledge along the western margin 

of the whitish lacustrine exposures. A second ridge top unit (red arrows) 
was considered to cap the entire sequence. RhyUn, RhyOv1 and RhyOv2 


indicate sampling localities of the present study for K-Ar dating. The 


fault contact 


RhyUn unit dips westwards and stratigraphically underlies the Chorora 
Formation sediments. The horizontally bedded light brown sediments 
are Middle Pleistocene in age and unconformably cap both RhyUn and 
Chorora Formation sediments. The yellow and red dotted lines indicate 
approximate positions of the step faults that resulted in uplifted RhyOvl 
and RhyOv2 exposures. b, View northeast at the RhyOv1 sampling spot. 
Yellow arrows point to the RhyOv1 rhyolitic ignimbrite unit forming a 
planar terrace. c, View west of the b RhyOv1 section (yellow arrows). Note 
that the rhyolitic ignimbrite unit terminates abruptly (dotted line) in fault 
contact with sediments previously considered to underlie the ignimbrite””. 
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microanalysis of feldspar separates of the mesh size fraction that was used in the K—Ar dating of samples listed in Extended Data Table 1. The full data 
are shown in Supplementary Table 3. 
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consolidated tuff unit collected <5 m apart. Samples 10TC-52 and 10TC-53 
are from the same pumiceous tuff collected ~30 m apart. The weighted 
means of these are shown in Fig. 3 and Extended Data Table 1. 


© 2016 Macmillan Publishers Limited. All rights reserved 


LETTER 


Type Locality Betcha 
RhyOv2 


nhsves Coty. 
104% * e % ii 
* eK EH 
Ries 140-1 
RhyOv1 cup dt #44 y4p5 a 9,070.21 Ma 
4 42828*/9.20-+0.25 Ma 
Sreniinieae | ELI} 19082 
0.26Ma. [eres s|9.750.51 Ma 
26 Ma, |i eee 
m Up 4 Up 


Gololcha 


UI 
Adadi njee It 
7.030.16 Ma tt Down 
* Normal fault 
He 
CT-1 + 1070-59 Ae les 
tana al = 10TC-42 25+ Hii. 107C-58 
Leet 4) 10,08-40.45 Ma Rane 6740.17 Ma 
ro" 15DA-1 Ct 
Up 


9.54+0.41 Ma E 


Normal fault ld 


CT2 eee +10TC-60 bessses 
= <137C-8 CT-7 


J< 1210-6 
<121C-7 99% 
7.21£0.16 Ma 


§\ +12708 
+ 1270-9 
6.75+0.17Ma ™ 


3] + 10TC-62 = 


Eee Fossiliferous 


Fx] layers 
pa + 10TC-52, 53 


8.14+0.20 Ma 
8.22+0,21 Ma 


== < 107-21 
8,730.21 Ma 
2) ba sig 
{| 
Co «071-2 CT-5 
Ea 


Down 4p * =m 


Normal fault 


+ 13D-2 9.24+0.22 Ma 


< 15DA-8 9.29:£0.30 Ma 
+ 130-3 8890.28 Ma RhyUn 
14D-7 9.25-£0.22 Ma 


eee + 10TC-54, 11TC-6 
9.05+0.20 Ma 


a+ 11TC-9 CT-8 


+ 13D-6 9.18+0.21 Ma 


Gb28, 29> fay + 11TC-4 9.40+0.25 Ma 
Paleomagnetic measurements 
> Normal polarity 

> Reversed polarity eyDiamte | 
> Intermediate polarity sitiSandy clay 


Fine sand 


+ 14D-2 8.76+0.21 Ma 
Ignimbrite/Lava 

‘Volcanic ash/Pumice 
GravelPebbly sand 


> Polarity undetermined Medium sand | 
Extended Data Figure 5 | Chorora Formation stratigraphic columns considered to occur at multiple localities, and shown as marker tuffs CT-1 
of the type locality, Beticha and nearby localities. The sections were to CT-8. The analytical details of the tuffs are presented in Supplementary 
taken at exposures adjacent to or continuous with the sampling locations Tables 1 and 2. CT-6 is identified at Adadi from lithologic, petrographic, 
of the dated volcanic samples (coordinates tabulated in Extended Data glass morphology and refractive index analyses. Results of the remanent 
Table 1). Radioisotopic dates (K-Ar ages) and samples correspond to magnetism analysis are given in Supplementary Table 4. See Extended 
those shown in Fig. 3 and Extended Data Table 1. Eight tephra units are Data Fig. 6 for explanation of the lithological codes. 
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Extended Data Figure 6 | Chorora Formation stratigraphic columns 
of the western localities. The sections were taken at exposures adjacent 
to or continuous with the sampling locations of the dated volcanic 
samples (coordinates tabulated in Extended Data Table 1). Radioisotopic 
dates (K—Ar ages) and samples correspond to those shown in Fig. 3 and 
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Extended Data Table 1. Tephra unit CT-4 occurs at multiple localities. 
The analytical details of the tuffs are presented in Supplementary Tables 1 
and 2. Analytical results of the remanent magnetism analysis are given in 
Supplementary Table 4. 
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Extended Data Table 1 | Summary of K-Ar and 4°Ar—?°Ar dating 
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H H 40 39 
No. Sample Locality geile ta Material Mesh size Kwt% 10°cc —— ae ae ie 20 
STP/g 
1 12TC-5 Chifara above "series2" Pm-tuff #60-100 4.760+0.095 130.1+1.3 2.9 7.03 0.16 
N 8.8817, E 40.3042 
2 12TC-7 Chifara above "series2" Tuff #60-100 3.224+0.064 90.4+0.9 6.0 7.21 0.16 
N 8.8818, E 40.3052 
3 12TC-9 Chifara above "series2" Tuff #60-100 4.44340.089 116.641.7 28.9 6.75 0.17 
N 8.8817, E 40.3052 
4 15DA-4 Teso Tadecho above "series2" Basalt #200-300 1.24340.025 32.49+0.45 23.9 6.72 0.16 
N 8.8589, E 40.2548 
5 15DA-10 Gutosadeen above "series 2" Basalt #200-300 1.318+0.026 35.59+0.53 31.1 6.94 0.17 
N 8.9043, E 40.2930 
6 14D-5 Odakora North above "series 2" RI #100-200 3.92740.079 115.742.2 42.5 7.57 0.21 
N 8.8757, E 40.2726 
7 10TC-1/58 Beticha upper "series 2" Pm-tuff #100-200 3.90540.078 116.541.3 95 7.67 0.17 7.82(n=6) 0.11 
N 8.8660, E 40.2948 
8a 10TC-52/53 Beticha upper "series 2" Pm-tuff #150-200 3.21840.064 101.941.5 30.8 8.14 0.20 7.86 (n=6) 0.10 
8b same as above #150-200 3.20040.064 102.341.7 36.5 8.22 0.21 
N 8.8655, E 40.2949 (10TC-52) 
N 8.8654, E 40.2946 (10TC-53) 
9 10TC-21b Type middle "series 2" Pm-tuff #150-200 4.115+0.082  139.741.7 19.7 8.73 0.21 
N 8.8900, E 40.3068 
10 10TC-54/11TC-6 Gololcha middle "series 1" Pyr #150-200 4.07540.082 143.441.5 10.7 9.05 0.20 
N 8.8999, E 40.3120 
11. 13D-6 Chifara lower "series 1" Pyr #100-200 5.03240.101 179.741.9 5.7 9.18 0.21 
N 8.8806, E 40.3047 
12 13D-2 Typet RhyUn RI #100-200 5.469+0.109 196.642.6 22.7 9.24 0.22 
13 13D-3 Typet RhyUn RI #100-200 5.27640.106 182.444.5 41.9 8.89 0.28 
14 14D-7 Typet RhyUn RI #100-200 5.35140.107 192.54+2.4 21.2 9.25 0.22 
15 15DA-8 Typet RhyUn RI #100-200 5.024+0.100 181.7445 53.4 9.29 0.30 
N 8.8883, E 40.3071 (13D-2, 13D-3, 14D-7, 15DA-8) 
16 13D-1 Type RhyOv1 RI #100-200 5.461+40.109 191.9+4.1 46.0 9.03 0.26 
N 8.8940, E 40.3074 
17 10TC-42 Gololcha RhyOv1 RI #100-200 2.073+0.041 81.343.2 68.5 10.08 0.45 9.52 0.22 
N 8.9014, E 40.3122 
18 15DA-1 Gololcha RhyOv1 RI #100-200 4.511+40.090 167.44+6.4 72.9 9.54 0.41 
N 8.9015, E 40.3123 
19 14D-6 Type RhyOv2 RI #100-200 4.99440.100 178.843.4 42.0 9.20 0.25 
N 8.8940, E 40.3042 
20 15DA-2 Type RhyOv2 RI #100-200 4.58040.092 173.848.4 72.9 9.75 0.51 
N 8.8892, E 40.3012 
21 10TC-2/11TC-4 Betichat RhyUn correlative RI #100-200 3.80040.076 138.942.4 38.0 9.40 0.25 10.20 0.48 
22 14D-2 Betichat RhyUn correlative RI #100-200 5.367+0.107 182.942.3 19.9 8.76 0.21 
N 8.8649, E 40.2953 (10TC-2/11TC-4, 14D-2) 
23 14D-1 Beticha RhyUn correlative RI #100-200 5.45540.109 192.6+2.1 12.0 9.07 0.21 
N 8.8659, E 40.2929 
24 13D-7 Beticha below 14D-1 Basalt #150-300 0.370+40.007 14.8510.7 71.3 10.31 0.51 
N 8.8651, E 40.2928 
K-Ar age weighted means 
Nos. 8a—8b Beticha upper "series 2" Pm-tuff (n=2) 8.18 0.15 
Nos.12-15 Type RhyUn RI (n=4) 9.18 0.12 
Nos.16-18 Type/Golocha RhyOv1 RI (n=3) 9.36 0.20 
Nos.19-20 Type RhyOv2 RI (n=2) 931 0.23 
Nos.21-23 Beticha RhyUn correlative RI (n=3) 9.04 0.13 
Nos.12—20 Type/Golocha RhyUn/Ov1/Ov2 RI (n=9) 9.25 0.09 
Nos.12-23 Type/Golocha/Beticha —§ RhyUn/Ov1/Ov2 RI (n=12) 9.17 0.08 


K-Ar and single crystal step-heating *°Ar—9Ar ages are in million years; weighted means are shown with number of crystals or samples in parentheses. RI, rhyolite ignimbrite; Pm-tuff, pumiceous tuff; 
Pyr, rhyolitic pyroclastics. RhyUn refers to the rhyolitic ignimbrite unit that stratigraphically underlies the Chorora Formation sediments at the type locality. RhyOvl and RhyOv2 refer to type locality 
rhyolitic ignimbrite units that were previously considered to overlie the Chorora Formation sediments (see Fig. 2 and Extended Data Fig. 2). 

tSamples 13D-2, 13D-3, 14D-7 and 15DA-8 were taken at the same RyhUn section (shown in Extended Data Fig. 2); 13D-2 was taken 2m above the other three samples; the last three were taken at 
near-identical positions. 

tSamples 1OTC-2/11TC-4 and 14D-2 were taken ~5m from each other. 
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Extended Data Table 2 | List of mammalian taxa recovered from the Chorora Formation, and comparison with the 9.5-9.9 Myr old Nakali/ 
Samburu Hills fauna and the 6.5-7.4 Myr old Lothagam Lower Nawata Member fauna 


Nakali and Samburu Hills Chorora ae Beticha aoe Lothagam Lower+Upper Nawata 
Primates 
Samburupithecus kiptalami (S) 
Nakalipithecus nakayami (N) Chororapithecus abyssinicus Rew 
Hominidae sp. (U) 

Microcolobus sp. (N) cf. Microcolobus Oo 
Colobinae sp. cf O Colobinae sp. A, sp. B (L+U) 
Cercopithecinae sp. Olarge  Parapapio lothagamensis (L+U) 


Cetartiodactyla 


Kenyapotamus coryndoni (N+S) 


cf. Kenyapotamus 


Hippopotaminae gen. et sp. nov. 


Hippopotaminae sp. 


Nyanzachoerus sp. large (N+S) 


Nyanzachoerus sp. 


Archaeopotamus spp. 


Nyanzachoerus tulotos (L+U) 


2Samotherium sp. Sivatheriini sp. Oo O Sivatherium sp. (?L) 
Paleotragus cf. germaini (N+S) Paleotragus sp. O O Paleotragus germaini (L+U) 
Boselaphini sp. small (N?+S) Boselaphini sp. small O O Boselaphini sp. small (L+U) 
cf. Bovini Bovini sp. (L+U) 
?Reduncini (S) Reduncini sp. cf O ae on Reduncini spp. (L+U) 


cf. Neotragini sp. 


Neotragini spp. (L+U) 


Perissodactyla 


“C.” aff. africanum (N+S) 


“Cormohipparion ” sp. 


small Hipparionini 


Hipparionini sp. small 


Paradiceros mukirii (S) 


Ceratotherium sp. 


Eurygnathohippus turkanensis (L+U) 


Eurygnathohippus feibeli (L+U) 


Ceratotherium sp. (L+U) 


Diceros sp. (N) 


cf. Dicerotini 


Diceros bicornis/praecox (U) 


small schizotheriinae 


Ancylotherium tugenensis 


Proboscidea 


Tetralophodon sp nov. (S) 


Elephantidae sp. (N) Stegotetrabelodon sp. Bom O? Ber Stegotetrabelodon orbus (L+U) 
Deinotherium sp. (N+S) Deinotherium sp. O Deinotherium bozasi (L+U) 
Carnivora 
?Herpestides afarensis oO 
Machairodontinae sp. (S) Machairodus cf. aphanistus Oo Lokotunjailurus emageritus (L+U) 
Felidae sp. large medium 


Percrocuta leakeyi (N) 


Percrocutidae sp. large 


Hyaenidae spp. (N+S) cf. Hyaenictis O Hyaenictis sp. (L+U) 
Lagomorpha 
Alilepus sp. Alilepus (L*) 
Rodentia 
Paraulacodus sp. (N+S) Paraulacodus johanesi O Paraulacodus sp. (L) 


Paraphiomys sp. (N+S) 


Paraphiomys chororensis 


Nakalimys lavocati (N) 


Nakalimys lavocati 


Paraphiomys chororensis (L) 


Gerbillinae (N) 


Gerbillinae sp. large cf. Abudhabia 


Bom 
O 
O 


Abudhabia sp. (U) 


Afromys guillemoti 


“Dendromus” (?Saccostomus ) spp. 


Praecomys kikiae 


aff. Stenocephalomys 


cf. Parapelomys 


cf. Tectonomys 


Xerus sp. 


Faunal list of Nakali and Samburu Hills adapted from refs 8, 16, and that of Lothagam from ref. 17. Only taxa that are related or phylogenetically close to the Chorora Formation taxa are listed. Grey 
shaded cells indicate taxa diagnostically distinct between the 9.5-9.9 Myr and 6.5-7.4 Myr old eastern African assemblages. Orange cells are Chorora Formation occurrences that suggest continuity 
with Nakali/Samburu Hills. Green cells are derived Chorora Formation taxa that indicate a biochronologically younger age than Samburu/Nakali and an older age than the Lothagam Lower Nawata 
Member. Blue cells indicate affinities between the Chorora Formation and Lower Nawata Member taxa. 

*Lower Nawata Member Alilepus at Lothagam comes from the 6.5-6.6 Myr old levels. 
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Extended Data Table 3 | Values of 323C and 5180 of Chorora Formation hipparionin and hippopotamid tooth enamel 


Specimen no. Taxon Lab Sample no. Element 3'°C%o PDB 3'20%e VSMOW 5'°O% PDB 
CHO-TL 1 “Cormohipparion” sp. KGTE 163 left upper M1 -3.7 32.7 1.8 
CHO-TL 6 “Cormohipparion” sp. KGTE 162 left lower M1 -4.8 31.9 0.9 
CHO-TL 9 “Cormohipparion” sp. KGTE 164 premolar/molar fragment -5.3 33.3 2.3 
CHOR 1 (ave) “Cormohipparion” sp. left lower P4 -6.3 
CHOR 2 “Cormohipparion” sp. left upper M1 -4.6 
CHOR 3 (ave) “Cormohipparion” sp. left lower M1 -5.8 
CHOR 5 “Cormohipparion” sp. left lower M2 -5.1 
CHO-BT 18 Hippopotamidae KGTE 166 molar fragment -12.0 23.8 -6.9 
CHO-BT 30 Hippopotamidae KGTE 165 molar fragment -12.5 24.2 -6.5 


Values are in parts per thousand difference from 18C/!2C and 180/160 ratios of international standards. PDB fossil carbonate for carbon; PDB and Vienna Standard Mean Ocean Water for oxygen. 
CHOR 1 to CHOR 5 data are from ref. 4; (ave) indicates the average of three samples of the same tooth. The Chorora type locality equid 6!8C results are discussed in ref. 22 and indicate a mixed 
feeding diet of ~35-60% Cy. The Beticha hippopotamid 6!5C values indicate that these individuals ate virtually no Cy plants. These values are lower than in the chronologically earlier (9.5-9.9 Myr old) 
hippopotamids (Kenyapotamus coryndoni) from Nakali (—9.7%o) and Samburu Hills (—6.0%o or —5.6%p) (ref. 26), and much lower than in the later (6.5-7.4 Myr old) Hippopotaminae from the 
Lothagam Lower Nawata Member (—4.3%o) (ref. 26) and Toros Ménalla in Chad (—3.5%o) (ref. 49). 
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A thalamic input to the nucleus accumbens 
mediates opiate dependence 


Yingjie Zhu!, Carl F R. Wienecke!, Gregory Nachtrab! & Xiaoke Chen! 


Chronic opiate use induces opiate dependence, which is 
characterized by extremely unpleasant physical and emotional 
feelings after drug use is terminated. Both the rewarding effects 
of a drug and the desire to avoid withdrawal symptoms motivate 
continued drug use!~3, and the nucleus accumbens is important 
for orchestrating both processes**. While multiple inputs to the 
nucleus accumbens regulate reward®”, little is known about the 
nucleus accumbens circuitry underlying withdrawal. Here we 
identify the paraventricular nucleus of the thalamus as a prominent 
input to the nucleus accumbens mediating the expression of opiate- 
withdrawal-induced physical signs and aversive memory. Activity in 
the paraventricular nucleus of the thalamus to nucleus accumbens 
pathway is necessary and sufficient to mediate behavioural aversion. 
Selectively silencing this pathway abolishes aversive symptoms 
in two different mouse models of opiate withdrawal. Chronic 
morphine exposure selectively potentiates excitatory transmission 
between the paraventricular nucleus of the thalamus and D2- 
receptor-expressing medium spiny neurons via synaptic insertion 
of GluA2-lacking AMPA receptors. Notably, in vivo optogenetic 
depotentiation restores normal transmission at these synapses and 
robustly suppresses morphine withdrawal symptoms. This links 
morphine-evoked pathway- and cell-type-specific plasticity in the 
paraventricular nucleus of the thalamus to nucleus accumbens 
circuit to opiate dependence, and suggests that reprogramming this 
circuit holds promise for treating opiate addiction. 

To systematically map brain regions that directly innervate the 
nucleus accumbens (NAc), we stereotaxically injected a rabies virus in 
which the viral glycoprotein was replaced by red fluorescent protein 
mCherry (RV-mCherry) into the medial shell of the NAc'®!". Besides 
well-characterized inputs to the NAc, such as the prefrontal cortex, 
ventral hippocampus and basolateral amygdala (BLA)*'”!* (Extended 
Data Fig. 1a), we also detected mCherry-expressing neurons in the 
paraventricular nucleus of the thalamus (PVT) (Fig. 1a and Extended 
Data Fig. 1b). This result was particularly interesting because although 
previous studies have suggested a potentially important role for the 
PVT in drug-seeking behaviour, its underlying circuitry mechanism 
remains unknown!**’, 

To characterize the PVT to NAc connection, we injected channel- 
rodopin2-expressing adeno-associated virus (AAV-ChR2) into the 
PVT, then prepared acute NAc slices containing ChR2-expressing 
terminals from the PVT’* (Extended Data Fig. 2a). Brief light stimu- 
lation (3-5 ms) elicited reliable firing of action potentials up to 20 Hz 
in ChR2-expressing PVT neurons (Extended Data Fig. 2b). The same 
stimulation also evoked robust a-amino-3-hydroxy-5-methyl-4- 
isoxazolepropionic acid receptor (AMPAR)-mediated excitatory post- 
synaptic currents (EPSCs) in medium spiny neurons (MSNs), as it was 
blocked by bath application of a competitive AMPAR antagonist CNQX 
(Extended Data Fig. 2c). Light stimulation also evoked picrotoxin- 
sensitive inhibitory postsynaptic currents (IPSCs) (Extended Data 
Fig. 3b). Because the PVT contains few, if any, GABAergic neurons 
(Extended Data Fig. 3a), it is likely that these IPSCs were caused by 


feed-forward inhibition in a local NAc circuit. Consistent with this 
prediction, PVT activation-evoked IPSCs had a longer delay than that 
of the EPSCs, and the IPSCs were blocked by CNQX (Extended Data 
Fig. 3c, d). 

Activation of inputs from the prefrontal cortex, ventral hippocampus 
and BLA to the NAc is rewarding and drives self-stimulation behav- 
iour®”. To directly assess the behavioural consequences of PVIT—NAc 
circuit activity, we optogenetically activated this pathway in freely 
moving mice and examined their motivational valence using a real- 
time place preference assay (Fig. 1b, Methods and Extended Data 
Fig. 4). Strikingly, optogenetic activation of the PVT—NAc pathway 
reduced the time spent in the chamber paired with light stimulation 
(Fig. 1c). This indicates that, unlike other major inputs to the NAc, 
activation of the PVT—NAc pathway is aversive rather than rewarding. 
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Figure 1 | In vivo optical activation of the PVT—NAc pathway evokes 
behavioural aversion. a, Left: cluster of retrogradely labelled cells was 
observed in the PVT 5 days after injection of RV-mCherry into the medial 
shell of the NAc (n =7). Scale bar, 500 1m; inset shows the RV-mCherry 
injection site. Right: magnified image shows the morphology of labelled 
neurons in the boxed area. Scale bar, 501m. D, dorsal; L, lateral; 3V, third 
ventricle; DG, dentate gyrus. b, Schematics of in vivo manipulation of the 
PVT—NAc circuit in behaving animals. c, Representative real-time place 
preference tracks illustrate light-evoked behavioural aversion in ChR2- 
expressing mice (bottom, n= 10) but not in eGFP-expressing control 
mice (top, n = 8). d, Quantification of light-evoked aversion and its effect 
by intra-NAc pharmacological manipulations. Intra-NAc infusions of 
NBQX (AMPAR antagonist, 1.0 jug in 200 nl, n= 8) but not saline (n = 10), 
SCH23390 (D1R antagonist, 0.2 ,g in 200 nl, n = 8) or raclopride (D2R 
antagonist, 0.3 j1g in 200 nl, n = 8) abolished behavioural aversion evoked 
by optical stimulation of the PVT—NAc fibres. One-way analysis of 
variance (ANOVA) (F(4,37) = 29.61, P < 0.0001) followed by post-hoc 
Tukey’s test. ***P < 0.001. Mean + s.e.m. 


1Department of Biology, Stanford University, Stanford, California 94305, USA. 
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Avoidance of the light-paired chamber was dependent on local glu- 
tamatergic but not dopaminergic transmission in the NAc. Intra- 
NAc infusion of AMPAR antagonist NBQX abolished the avoidance 
behaviour, while intra-NAc infusion of D1 receptor (D1R) antagonist 
SCH23390 or D2 receptor (D2R) antagonist raclopride had no effect 
(Fig. 1d). These results demonstrate that the PVT NAc pathway trans- 
mits negative valence, and reveal an input-specific mechanism driving 
motivated behaviour in the NAc. 

Since activation of the PVT—NAc pathway evoked an aversive 
response, this pathway could be instrumental for the negative symp- 
toms of drug withdrawal. To test this, we selectively silenced the 
pathway in two different models of opiate withdrawal and exam- 
ined withdrawal-induced physical signs and place aversion. We used 
optogenetic terminal silencing by injecting an archaerhodopsin-3 
(ArchT)-expressing AAV into the PVT and bilaterally implanting 
fibre guide cannulae in the NAc!” (Extended Data Fig. 4). Two weeks 
after surgery, mice were rendered opiate dependent via daily intra- 
peritoneal (i.p.) injections of morphine in their home cage with doses 
escalating from 10 to 50 mg per kg body weight'® (Fig. 2a). Two hours 
after the final morphine treatment, we injected naloxone, a .-opioid 
receptor antagonist (5 mg per kg body weight, i.p.), and confined the 
mice to one side of a two-compartment conditioned place aversion 
(CPA) training chamber’. This naloxone dose evoked strong negative 
somatic signs and robust avoidance to the withdrawal chamber in 
morphine-dependent mice, but not in drug-naive mice (Extended Data 
Fig. 5). Moreover, injection of naloxone in chronic morphine-treated 
mice elicited robust expression of c-Fos, a marker for recent neuronal 
activity, in the PVTN“¢ projection neurons, which were labelled by 
injecting of retrograde tracer CTB-488 into the NAc (Fig. 2b, c and 
Extended Data Fig. 6a, b). Remarkably, constant bilateral optogenetic 
silencing of the PVT—NAc pathway during naloxone-precipitated 
withdrawal suppressed somatic signs of opiate dependence and learned 
place-aversion (measured 1 and 7 days after withdrawal) (Fig. 2d, e). 

Opiate withdrawal in humans often results from cessation or reduc- 
tion of opiate use, rather than blockade of opioid receptors, and this 
process can also be modelled in mice”!’. Spontaneous opiate with- 
drawal also evoked expression of c-Fos in PVT“ projection neurons 
(Extended Data Fig. 6b, c). To measure aversive motivational states 
during spontaneous opiate withdrawal, mice were confined for 45 min 
to one side of a CPA training chamber 16h after each morphine injec- 
tion. After four training sessions, mice developed aversion to the with- 
drawal chamber. We employed inhibitory designer receptors exclusively 
activated by designer drugs (DREADD; AAV-hM4Di) to selectively 
silence the PVT—NAc pathway during each 45 min CPA training 
trial?®?! (Fig. 2f and Methods). Local infusion of clozapine-N-oxide 
(CNO, 3 1M) into the NAc before each CPA training session reduced 
the CPA score in AAV-hM4Di but not enhanced green fluorescent 
protein (eGFP)-expressing AAV (AAV-eGFP) transduced mice 
(Fig. 2g). PVT—NAc pathway silencing had no effect on morphine- 
induced locomotor activity, a measure of acute psychoactive drug 
effects’ (Fig. 2h). Interestingly, the PVT—NAc circuit has roles beyond 
controlling the expression of physical and emotional negative states 
after opiate withdrawal. Mild footshock and i-p. injection of LiCl also 
evoked robust expression of c-Fos in PVT“¢ projection neurons, and 
silencing the pathway during conditioning suppressed the ability of 
either stimulus to evoke behaviour aversion (Extended Data Fig. 7). 

Chronic opiate use causes profound neuroadaptive changes in the 
NAc”?3. The NAc comprises two major subtypes of MSN, defined 
by the expression of either D1 or D2 dopamine receptors (D1- or 
D2-MSNs). D1- and D2-MSNs are proposed to play opposing roles in 
mediating behavioural motivation and reward learning”*”*, and syn- 
aptic plasticity of these MSNs appears to be causally involved in behav- 
ioural adaptations to drug addiction and chronic pain states®””®. Since 
the PVT—NAc pathway transmits negative valence and mediates opiate 
withdrawal symptoms, we anticipated that chronic morphine exposure 
might cause plasticity of the PVT input selectively onto D2-MSNs. 
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Figure 2 | The PVT—NaAc pathway is required for expression of 
aversive withdrawal symptoms. a, Experimental timeline for b-e. 

b, Confocal images showing that naloxone-precipitated withdrawal 
induced robust expression of c-Fos (red) in the PVT“ projection neurons 
(green) that were retrogradely labelled by injection of CTB-488 into the 
medial shell of the NAc (n= 6). Left: scale bar, 100m. Right: magnified 
image shows the boxed area; scale bar, 50 jum. ¢, Percentage of PvTN*« 
projection neurons expressing c-Fos. Naloxone (grey bar, n = 6) but not 
saline (white bar, n =5) injection evoked significant expression of c-Fos 
in the PVT“ projection neurons. Mann-Whitney U-test. **P < 0.01. 

d, e, Quantification of naloxone-precipitated withdrawal behaviours 

and CPA score. Light stimulation in ArchT- (green bar, n = 10) but not 
eGFP- (white bar, n = 9) expressing mice during withdrawal significantly 
reduces the number of jump, rearing and tremor events (d) as well as the 
expression of CPA (e). Mann-Whitney U-test. *P < 0.05, **P< 0.01. 

f, CNO (31M) inhibits light-evoked synaptic current recorded from 
postsynaptic MSNs (n =5). Inset shows example light-evoked EPSC traces 
before (black) and after (red) perfusion of CNO. Wilcoxon signed-rank 
test. *P < 0.05. Scale bar, 20 pA, 25 ms. g, Spontaneous opiate withdrawal 
induced CPA was reduced by local infusion of CNO in hM4Di- (red, n= 8) 
but not eGFP- (black, n = 8) expressing mice, or local infusion of saline in 
hM4Di- (magenta, n = 8) expressing mice. One-way ANOVA (F(2,21) =7.4, 
P<0.01) followed by post-hoc Tukey’s test. *P < 0.05, **P < 0.01. h, Light 
stimulation has no effect on locomotor velocity in either saline- (n=9, 
P=0.57) or morphine- (n= 9, P=0.5) injected animals. Wilcoxon 
signed-rank test. Mean + s.e.m. 


To directly examine morphine-induced synaptic plasticity of the 
PVT input onto identified NAc MSNs, we prepared brain slices from 
transgenic animals expressing fluorescent proteins under the control 
of the D1R or D2R promoter, with PVT neurons infected with AAV- 
ChR2 (Fig. 3a). To reduce experimental variability between slices, 
we measured the ratio of light-evoked AMPAR-mediated EPSCs to 
N-methyl-p-aspartate receptor (NMDAR)-mediated EPSCs (AMPAR/ 
NMDAR ratio) in the same MSN. Consistent with our hypothesis, the 
escalating morphine regimen increased this ratio in D2-MSNs but 
not in D1-MSNs (Fig. 3b, c and Extended Data Fig. 8a, b). In con- 
trast, presynaptic release probability was not changed in either type 
of MSN, as measured by the paired-pulse ratio of light-evoked EPSCs 
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Figure 3 | Morphine-induced potentiation at the PVT—D2-MSN 
synapses. a, Image of a NAc slice from a D1-TdTomato and D2-eGFP 
double transgenic mouse (n= 5). Scale bar, 50 1m. b, c, Example traces (b) 
and quantification (c) of light-evoked EPSCs at —70 mV and +40 mV 
show that chronic morphine treatment significantly increased the 
AMPAR/NMDAR ratio in D2-MSNs (saline/morphine, n = 13/16 cells), 
but not D1-MSNs (saline/morphine, n = 14/14 cells). Two-way ANOVA 
(Fa,53) = 12.58, P< 0.001) followed by post-hoc Tukey’s test. ***P < 0.001. 
For comparison, EPSC amplitudes are normalized to peaks at +40 mV. 
Filled dots indicate the current amplitude used for calculating the 
AMPAR/NMDAR ratio. Scale bar, 300 pA, 50 ms. d, e, Example traces 

(d, left), I/V curve (d, right) and quantification (e) of light-evoked AMPAR 
EPSCs at —70 mV, 0mV and +40 mV show that morphine treatment 
selectively increased the rectification index of AMPAR EPSCs in D2-MSNs 
(saline/morphine, n = 7/10 cells), but not D1-MSNs (saline/morphine, 

n= 8/9 cells). Two-way ANOVA (F(:,30) = 9.87, P< 0.01) followed by 
post-hoc Tukey’s test. **P < 0.01. For comparison, amplitudes of AMPAR 
EPSCs are normalized to peaks at —70 mV. Scale bar, 250 pA, 50 ms. 

Mean +s.e.m. 


in morphine-dependent mice (Extended Data Fig. 9). These results 
suggested that chronic morphine treatment caused a change in postsyn- 
aptic function. One plausible mechanism of drug-induced postsynaptic 
change is the insertion of GluA2-lacking calcium-permeable AMPAR 
(CP-AMPAR) in the affected NAc synapses’’. Compared with other 
AMPARs, CP-AMPARs have a larger conductance and show inward 
rectification at positive membrane potentials. In morphine-treated 
mice, we observed a significant increase in the rectification index selec- 
tively at PVT—D2-MSN synapses, but not at PVT—D1-MSN synapses 
(Fig. 3d, e). Interestingly, the AMPAR/NMDAR ratio at BLA—NAc 
synapses, which are part of a pathway that drives reward-seeking behav- 
iour, was not changed by the same morphine treatment (Extended 
Data Fig. 8c—e). Collectively, these results indicate that the morphine 
regimen strengthened the PVT input selectively onto D2-MSNs via 
synaptic insertion of CP-AMPARs. 

If morphine-induced potentiation of the PVT—-D2-MSN synapses 
is essential for the expression of opiate withdrawal symptoms, then 
depotentiating these synapses in vivo to restore its normal trans- 
mission should relieve those symptoms. To test this prediction, we 
employed long-term depression (LTD)-based in vivo manipulation of 
the PVT—D2-MSN synapses by photostimulating ChR2-expressing 
PVT terminals in the NAc at 1 Hz for 15 min®””8 (Fig. 4a). We applied 
this in vivo optogenetic LTD induction protocol in chronic mor- 
phine-treated mice, then prepared NAc slices to record photocurrents 
on visually identified D1- and D2-MSNs. Indeed, this light treatment 
reduced the AMPAR/NMDAR ratio, and rectified the AMPAR cur- 
rent at PVT—D2-MSN synapses to a level comparable to baseline 
transmission, but did not change the paired-pulse ratio (Fig. 4b-e and 
Extended Data Figs 8 f, g and 9). Treating morphine-dependent mice 
with this optogenetic LTD protocol 45 min before naloxone injection 
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Figure 4 | In vivo optogenetic LTD induction restores normal 
transmission at PVT—>D2-MSN synapses and suppresses withdrawal 
symptoms. a, Experimental timeline for b-e. b-e, In vivo 1 Hz 
photostimulation successfully normalized morphine-induced changes 

of AMPAR/NMDAR ratio and rectification index in D2-MSNs, but 

had little effect on D1-MSNs. b, c, AMPAR/NMDAR ratio of D2-MSNs 
(morphine/morphine + 1 Hz, n= 16/17 cells) and D1-MSNs (morphine/ 
morphine + 1 Hz, n= 14/14). Two-way ANOVA (F(,,57) = 13.86, P< 0.001) 
followed by post-hoc Tukey’s test. ***P < 0.001. d, e, Rectification index 
for AMPAR EPSCs in D2-MSNs (morphine/morphine+1 Hz, n= 10/10) 
and D1-MSNs (morphine/morphine+ 1 Hz, n= 9/7). Two-way ANOVA 
(Fo,32) = 4.3, P < 0.05) followed by post-hoc Tukey’s test. *P < 0.05. Scale 
bars, 300 pA, 50 ms (b); 250 pA, 50 ms (d). f, Experimental timeline for g, h. 
g, h, Quantification of withdrawal behaviours (g) and CPA score (h). 

In ChR2- (blue bar, n= 8) but not eGFP- (white bar, n = 8) expressing 
mice, in vivo 1 Hz stimulation suppressed naloxone-precipitated jumping, 
rearing and tremor events (g), and suppressed CPA to the withdrawal 
chamber (h). Mann-Whitney U-test. *P < 0.05, **P< 0.01. Mean +s.e.m. 


(Fig. 4f) reduced the immediate expression of withdrawal behav- 
ioural symptoms and the aversive memory of the withdrawal chamber 
(Fig. 4g, h). This 1 Hz optogenetic stimulation had no effect on the plas- 
ticity at PVT—-D1-MSN synapses (Fig. 4b-e). Together, these results 
establish a causal link between plasticity at PVT—>D2-MSN synapses 
and the negative somatic and motivational states that accompany opiate 
withdrawal. 

Complementary to previous studies that highlight the contribution 
of the prefrontal cortex, BLA, and ventral hippocampus inputs to the 
NAc in mediating drug reward and their plasticity onto D1-MSNs after 
chronic drug exposure, here we show that the PVT input transmits neg- 
ative valence and its plasticity at PVT—D2-MSN synapses is necessary 
for the expression of aversive states associate with opiate withdrawal. 
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We further demonstrate that optogenetic restoration of normal synaptic 
transmission at these synapses effectively relieves withdrawal symp- 
toms. Our optogenetic LTD protocol may inspire the development of 
novel treatments for opiate addiction involving deep brain stimulation 
to induce plasticity at relevant synapses”’. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

Animals. Male adult (5-12 weeks) C57BL/6, Drdla-tdTomato, Drd2—eGFP and 
Drd1la-tdTomato/Drd2-eGFP double BAC transgenic mice were used for all 
experiments", All procedures were in accordance with the US National Institutes 
of Health (NIH) guidelines for the care and use of laboratory animals, and were 
approved by Stanford University’s Administrative Panel on Laboratory Animal 
Care. 

Surgery. Stereotaxic injections were performed on 5- to 8-week-old mice under 
ketamine and xylazine (100 mg per kg body weight and 10 mg per kg body weight, 
i.p.) anaesthesia using a stereotaxic instrument (BenchMARK Digital, Lecia). 
A small volume of concentrated virus solution was injected into the PVT (200 nl 
AAV, bregma — 1.4mm; lateral 0.1 mm; ventral 3.0 mm, with a 4° angle towards the 
midline) or medial shell of the NAc (500 nl RV-mCherry, bregma 1.1 mm; lateral 
0.8mm; ventral 4.6 mm) with a pulled glass capillary at a slow rate (100 nl min~!) 
using a pressure microinjector (Micro 4 system, World Precision Instruments). 
The injection needle was withdrawn 5 min after the end of the injection. For mice 
involved in behavioural experiments, a fibre-optic cannula (plastic one) was placed 
at least 500 1M above the medial shell of the NAc, and cemented onto the skull 
using dental cement (Lang Dental Manufacturing). After surgery, a dummy was 
inserted to keep the guide cannula from getting clogged. Mice were allowed at least 
2 weeks to recover and to express the virus before behavioural training. Rabies 
virus-injected mice were used 5 days after injection. The injected AAV led to the 
expression of ChR2, ArchT or hM4Di in PVT neurons extending from Bregma 
—1.82mm to —0.94mm (Extended Data Fig. 4), which covered the area with the 
highest density of PVTN“¢ projection neurons and the highest density of neu- 
rons expressing c-Fos in response to naloxone-precipitated withdrawal (Extended 
Data Fig. 1b-e). AAVs used in this study were produced by the University of 
Pennsylvania vector core: AAV1.hSyn.eGFP.WPRE.bGH, 1.76 x 101% genomic 
copies per millilitre; AAV1.hSynp.hChR2(H134R)-eYFP.WPRE.hGH, 3.35 x 10° 
genomic copies per millilitre, AAV9.CAG.ArchT.GFP.WPRE.SV40, 7.62 x 10! 
genomic copies per millilitre; and by the University of North Carolina vector 
core: AAV8.hSynp.hM4Di-mCherry, 8.3 x 10'? genomic copies per millilitre. 
The G-deleted Rabies-mCherry virus (4.5 x 10° genomic copies per millilitre) 
was produced by the gene transfer, targeting and therapeutics core at the Salk 
Institute for Biological Studies. 

Behavioural assays. All mice used in behavioural assays were allowed to recover 
from surgery of AAV injection and cannula implantation for at least 3 weeks. Place 
preference training was performed in a custom-made two-compartment condi- 
tioned place preference (CPP) apparatus (30cm x 25cm x 20cm). All locomotor 
behaviours, including real-time place preference, morphine-induced locomo- 
tion, and withdrawal-, LiCl-, footshock-induced place aversion were recorded for 
15-20 min at 30 frames per second with a camera controlled by custom tracking 
software running on MATLAB (MathWorks). CPA score was calculated by sub- 
tracting the time spent in the aversive stimulus-paired side of the chamber during 
baseline from the time spent in the same side of the chamber during the test. 
Real-time place preference assay*’. After connecting with optical fibre, mice 
infected with AAV-ChR2 or AAV-eGFP were placed in the CPP training apparatus 
for 15 min to assess their baseline place preference. During the test, we assigned 
the counterbalanced side of the chamber as the stimulation side, and placed the 
mice in the non-stimulated side to start the experiment. When the mouse crossed 
to the stimulated side of the chamber, it triggered 20 Hz laser stimulation (473 nm, 
20 ms pulses, 7 mW mm’ at the NAc) until the mouse crossed back to the non- 
stimulated side. Avoidance score was calculated by subtracting the time spent in 
stimulation side during baseline (without light) from the time spent in stimulation 
side during the test (with light). 

Naloxone-precipitated morphine withdrawal. Mice infected with AAV-ArchT 
or AAV-eGFP were allowed to freely explore both sides of a custom-made CPP 
training apparatus for 15 min to assess their baseline place preference. Then, these 
mice received a single daily injection of morphine (i.p.) for 6 consecutive days with 
doses escalating at 10, 20, 30, 40, 50, 50 mg per kg body weight in their home cage 
to develop morphine dependence. Two hours after the last morphine injection, 
mice received an injection of naloxone (5 mg per kg body weight, i.p.) and were 
confined in the counterbalanced side of the CPP chamber. During the withdrawal, 
a constant green laser (532nm, 5mW mm “ at the NAc) was delivered through 
the cannula onto the NAc. We turned off the laser for 1 min every 5 min to avoid 
tissue damage*!. Withdrawal symptoms were recorded for 20 min before each 
mouse was returned to its home cage. Physical signs (jump, rearing and tremor) 
were manually quantified offline. One and 7 days after naloxone injection, with- 
drawal mice were re-exposed to the CPP chamber and allowed to explore both 
sides of the CPP chamber for 15 min. 

Spontaneous morphine withdrawal". Mice infected with AAV-hM4Di-mCherry 
or AAV-eGFP were allowed to freely explore both sides of a custom-made 
CPP training apparatus for 15 min to assess their baseline place preference. 
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Then, these mice received daily injections of morphine (20 mg per kg body 
weight, i-p.) in their home cage. Sixteen hours after each morphine injection, 
mice received a local infusion of CNO (31M, 200 nl) or saline (200 nl) through 
the cannula into the NAc. 20 min later, each mouse was confined for 45 min in the 
counterbalanced side of the training chamber. This entire procedure was repeated 
once per day for 4 days. Twenty-four hours after the final training session, mice 
were re-exposed to the CPP chamber and allowed to explore both sides of the 
chamber for 15 min. 

LiCl-induced CPA**. Mice infected with AAV-hM4Di-mCherry or AAV-eGFP 
were allowed to freely explore both sides of a custom-made CPP training apparatus 
for 15 min to assess their baseline place preference. Then, these mice were injected 
with saline (i.p.) and confined in the counterbalanced side of the training chamber 
for 1 h before return to home cage. Six hours later, the same mice were received a 
local infusion of CNO (31M, 200 nl) into the NAc 20 min before i.p. injection of 
LiCl (300 mg per kg body weight) and confined in the other side of the chamber for 
60 min. This entire procedure was repeated once per day for 4 days. Twenty-four 
hours after the final training session, mice were re-exposed to the CPP chamber 
and allowed to explore both sides of the chamber for 15 min. 
Footshock-induced CPA**?3, On day 1, mice infected with AAV-ArchT or AAV- 
eGFP were allowed to freely explore both sides of a custom-made CPP training 
apparatus for 15 min to assess their baseline place preference. On the second day, 
after connecting with optical fibre, these mice were confined in the counterbal- 
ance side of the chamber for 15 min and received ten unescapable mild foot- 
shocks (0.5 mA, 0.5 with 90s inter-stimulation interval) while constant green 
laser (532nm, 5mW mm “at the NAc) was delivered through the cannula onto 
the NAc. Twenty-four hours later, mice were re-exposed to the CPP chamber and 
allowed to explore both sides of the CPP chamber for 15 min. 
Morphine-induced locomotion. Mice infected with AAV-ArchT were habitu- 
ated to the testing apparatus (36cm x 36cm x 20cm) for two daily 30 min sessions 
before being subjected to the behavioural test on day 3. On the test day, mice were 
connected with optical fibres and received a single injection of saline or morphine 
(30 mg per kg body weight, i.p.). Five minutes after injection, mice were placed 
in the testing apparatus for 15 min with laser stimulation (532 nm, 5 mW mm ? 
at the NAc) alternating from off to on every 3 min. We compared the averaged 
locomotion velocity from two laser-on sessions with the mean locomotion velocity 
from three laser-off sessions. 

In vivo optogenetic depotentiation. Two weeks after recovery from AAV-ChR2 or 
AAV-eGFP injection and cannula implantation surgery, mice received daily injec- 
tions of morphine (i.p.) for 5 days with escalating doses of 10, 20, 30, 40, 50 mg per 
kg body weight in their home cage. For slice recording, mice were photostimulated 
for 15 min with trains of 473 nm light (1 Hz, 4ms, 900 pulses, 7 mW mm” at the 
NAc) 20h after the final morphine injection. For naloxone-precipitated morphine 
withdrawal, the same light stimulation was delivered 45 min before i.p. injection 
of naloxone. 

Electrophysiological recording. Procedures for preparing acute brain slices and 
performing whole-cell recordings with optogenetic stimulations were similar to 
those described previously***°. Coronal 250-300 1M slices containing the NAc 
were prepared using a vibratome (VT- 10008, Leica) in an ice-cold choline-based 
solution containing (in mM) 110 choline chloride, 2.5 KCl, 0.5 CaCh, 7 MgCh, 
1.3NaH2POxq, 1.3 Na-ascorbate, 0.6 Na-pyruvate, 25 glucose and 25 NaHCOs, 
saturated with 95% O2 and 5% COp. Slices were incubated in 32°C oxygenated 
artificial cerebrospinal fluid (in mM: 125 NaCl, 2.5 KCI, 2 CaCh, 1.3 MgCh, 
1.3NaH2PO,, 1.3 Na-ascorbate, 0.6 Na-pyruvate, 25 glucose and 25 NaHCOs) for 
at least 1h before recording. Slices were transferred to a recording chamber and 
superfused with 2 ml min“ artificial cerebrospinal fluid. Patch pipettes (2-5 MQ) 
pulled from borosilicate glass (PG10150-4, World Precision Instruments) were 
filled with a Cs-based low CI internal solution containing (in mM) 135 CsMeSO3, 
10 HEPES, 1 EGTA, 3.3 QX-314, 4Mg-ATP, 0.3 Na-GTP, 8 Na-phosphocreatine, 
290 mOsm kg" |, adjusted to pH 7.3 with CsOH. Whole-cell voltage-clamp record- 
ing was performed at room temperature (22-25 °C) with a Multiclamp 700B ampli- 
fier and a Digidata 1440A (Molecular Devices). Data were sampled at 10 kHz 
and analysed with pClamp10 (Molecular Devices) or MATLAB (MathWorks). 
TdTomato-expressing D1-MSNs or eGFP-expressing D2-MSNs in the NAc were 
visualized using an upright fluorescent microscope (Olympus BX51WI). A blue 
light-emitting diode (470 nm, Thorlabs) controlled by digital commands from the 
Digidata 1440A was coupled to the microscope with a dual lamp house adaptor 
(5-UL180, Olympus) to deliver photostimulation. To record light-evoked EPSCs, 
3-5 ms, 0.5-2 mW blue light was delivered through the objective to illuminate 
the entire field of view. To assess the efficiency of pharmacogenetic silencing of 
synaptic transmission, we first co-injected AAV-hM4Di and AAV-ChR2 into the 
PVT, then prepared NAc slices containing the ChR2 and hM4Di co-expressed 
terminals from the PVT. Light-evoked EPSCs were recorded from MSNs before 
and after perfusion with CNO (31M). 
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All experiments, except those in Extended Data Fig. 3, were performed in the 
presence of picrotoxin (100|1M). For Extended Data Fig. 3, picrotoxin or CNQX 
(10|1M) was perfused onto slices, and drug effects on light-evoked EPSCs/ 
IPSCs were measured by recording traces before and 5 min after drug perfusion. 
Membrane potential was held at —70 mV to record AMPAR-mediated current, at 
OmV to record GABA, receptor-mediated IPSC or at +40 mV to record NMDAR- 
mediated current. Amplitude of NMDAR current was quantified at 50 ms after 
stimulus, when the contribution of the AMPAR component was minimal. AMPAR/ 
NMDAR ratios were calculated by dividing the amplitude of the AMPAR current 
(peak current, at —70 mV) by the amplitude of the NMDAR current. The paired- 
pulse ratio was calculated by dividing the second light-evoked EPSC by the first 
with 100 ms intervals between the two. The AMPAR current was recorded at three 
holding potentials (—70 mV, 0 mV and +40 mV) in the presence of D-AP5 (501M). 
Rectification index (i,) of the AMPAR was calculated on the basis of the follow- 
ing equation: i, = (I_70/70)/(I40/40), in which I_79 and I4g were EPSC amplitudes 
recorded at —70mV and +40 mY, respectively. 

CTB tracing and c-Fos histology. Two to five days after bilaterally injection of 
CTB-488 (0.2 11, 0.5% in PBS) (Invitrogen, C-22841) into the medial shell of the 
NAc (bregma 1.1 mm; lateral 0.8 mm; ventral 4.6 mm), mice underwent naloxone- 
precipitated withdrawal, spontaneous withdrawal, footshock or LiCl treatment. 
Ninety minutes after the aversive stimuli, animals were deeply anaesthetized and 
perfused with 10 ml of saline (0.9%) followed by 10 ml of 4% paraformaldehyde 
in PBS. Coronal brain sections (501M) were cut using a cryostat (Leica). Brain 
sections (between bregma —1.2 to -1.9mm) were first washed in PBS (3 x 10 min), 
then blocked at room temperature with 10% NDS/0.3% Triton X-100 (PBST) and 
then incubated with primary anti-c-Fos antibody (Santa Cruz, SC-52G, rabbit 
polyclonal IgG, 1:2,000 dilution) for 3 days at 4 °C. Brain sections were washed 
in PBS (3 x 10 min), followed by incubation for 2 h with fluorophore-conjugated 
secondary antibody (1:1,000 in 5% NDS PBST) and finally counterstained with 


Hoescht (1:10,000, ThermoFisher Scientific). Images were obtained using a Zeiss 
510 confocal microscope or a Zeiss epifluorescence microscope, and analysed by 
an individual blind to the identity of experimental groups. 

Statistics. No statistics were used to predetermine sample size. However, our sam- 
ple sizes were similar to those reported in previous publications’ ~’. Statistical meth- 
ods are indicated when used, and statistical analyses of main figures are provided in 
Extended Data Table 1. All analyses were performed using MATLAB (MathWorks) 
or Prism. No method of randomization was used in any of the experiments. For 
ANOVA analyses, the variances were similar as determined by Brown-Forsythe 
test. Experimenters were not blind to group allocation in behavioural experiments, 
but CPA sore and locomotion were measured automatically by custom tracking 
software running on MATLAB (MathWorks). All animals that finished the entire 
behavioural training and testing were included in analysis. Data are presented as 
mean +s.e.m. 
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Extended Data Figure 1 | Retrograde tracing with rabies virus labels 
brain areas innervating the NAc and naloxone-precipitated opiate 
withdrawal induces c-Fos expression in the PVT. a, Representative 
images show the injection site in the medial shell of the NAc, and 
retrograde labelling with mCherry in the ventral hippocampus (vHipp), 
prelimbic cortex (PrL) and basolateral amygdala (BLA). b, Representative 
images show retrograde labelling with mCherry in the PVT, ranging from 
Bregma —0.4 to —2.2mm. ¢, Distribution of the density of retrogradely 
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labelled neurons in the PVT (n=7). d, Representative images show 
c-Fos expression in the PVT induced by naloxone-precipitated opiate 
withdrawal. e, Distribution of the density of c-Fos expressing neurons 

in the PVT (n=5). Area of the PVT at different anterior—posterior 
coordinates are determined on the basis of the brain atlas published by 
Allen Institute for Brain Science (available from http://mouse.brain-map. 
org/experiment/thumbnails/100142143?image_type=atlas). Scale bars, 
500\1m. Mean +s.e.m. 
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Extended Data Figure 2 | Optogenetic targeting of glutamatergic frequencies in ChR2-expressing PVT neurons (n =7). Scale bar, 25 mV, 
transmission between the PVT and the NAc. a, A confocal image shows 250 ms. c, Example traces (left) and quantification (right) of photo-evoked 
ChR2-eYFP- (enhanced yellow fluorescent protein) expressing terminals EPSCs recorded from the NAc before and after superfusion of CNQX 
from the PVT in the medial shell of the NAc. Inset, injection site in (101M, n= 6). Scale bar, 50 pA, 25 ms. Wilcoxon signed-rank test. 


the PVT. Scale bars, 500 1m. b, Example traces (left) and quantification *P < 0.05. Mean+s.e.m. 
(right) of action potential firing evoked by light stimulation at different 
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Extended Data Figure 3 | See next page for figure caption. 
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Extended Data Figure 3 | Optogenetic activation of the PVT 

input evoked feed-forward inhibition onto MSNs in the NAc. 

a, In situ hybridization with GAD1 (http://mouse.brain-map.org/ 
gene/show/14191), GAD2 (http://mouse.brain-map.org/experiment/ 
show/79591669) and vGlut2 (http://mouse.brain-map.org/experiment/ 
show/73818754) in the PVT (©2014 Allen Institute for Brain Science). 
b, c, Superfusion of a GABA, receptor antagonist (picrotoxin, 100 1M) 
selectively blocked IPSCs (b) (n= 4), whereas superfusion of an AMPAR 


antagonist (CNQX, 10|1M) blocked both IPSCs and EPSCs (c) (n=5). 
Scale bar, 400 pA, 30 ms. d, Example trace (left) and quantification 

(right) of the onset latency for photo-evoked IPSCs and EPSCs in the 

same neuron (n= 20). Scale bar, 400 pA, 10 ms. Wilcoxon signed-rank test. 
***P <0).001. e, Averaged amplitude of photo-evoked IPSCs in 

D1-MSNs (n= 11) and D2-MSNs (n= 14). Mann-Whitney U-test, 
P=0.67. Mean +s.e.m. 
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Extended Data Figure 4 | Location of ChR2, ArchT and hM4Di Fig. 1d and 8 mice in Fig. 4g, h. c, Overlay of ArchT-YFP expression in 
expression in the PVT. a, Schematics showing the PVT and its 18 mice, including 10 mice in Fig. 2d and 8 mice in Extended Data Fig. 7c. 
surrounding brain nucleus from anterior to posterior locations. 3V, third d, Overlay of hM4Di-mCherry expression in 16 mice, including 8 mice 
ventricle; MD, mediodorsal thalamus; CM, central middle thalamus; in Fig. 2g and 8 mice in Extended Data Fig. 7d. Intensity of red colour is 
DG, dentate gyrus; Hb, habenula; IMD, intermediodorsal thalamus. proportional to the number of mice expressing virus in the marked area. 


b, Overlay of ChR2-YFP expression in 21 mice, including 13 mice in 
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Extended Data Figure 5 | See next page for figure caption. 
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Extended Data Figure 5 | Dose-response analysis of naloxone- 
precipitated withdrawal symptoms. Concentration of naloxone was 
determined by 2 x 2 factorial design. Different doses of naloxone 

(0, 1, 5, 10 mg per kg body weight) were injected in chronic saline- 
(blue) or morphine- (red) treated mice. Behavioural signs of withdrawal 
including jump (a), rearing (b) and tremor (c) were recorded for 20 min 
immediately after naloxone injection. CPA (d) tests were performed 24h 
after withdrawal. One-way ANOVA followed by Tukey’s test. Jumping: 


morphine + naloxone group F(3,36) = 9.93, 


LETTER 


P<0.0001; rearing: saline 


+ naloxone group F(3,36) = 7.07, P< 0.01; morphine + naloxone group 
F3,36) = 22.98, P< 0.0001; tremor: saline + naloxone group F(3,29) = 3.74, 


P<0.05; morphine + naloxone group F(3,3 
saline + naloxone group F(3,39) = 2.67, P= 


6) = 40.48, P< 0.0001; CPA: 
0.066; morphine + naloxone 


group F(3,36) = 9.93, P< 0.0001; P values for post-hoc Tukey’s test are 


indicated on each comparison pair. Mean 4 
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Extended Data Figure 6 | Opiate withdrawal induced c-Fos expression 
in the PVT®¢ projection neurons. a, Example confocal image shows 
expression of c-Fos in a small number of PVT“ projection neurons after 
injection of saline into chronic morphine treated mice. Left: scale bar, 
100j1m. Right: magnified image shows the boxed area; scale bar, 50 zm. 

b, Quantification of CTB- (left) and c-Fos- (right) positive cells per square 
millimetre after injection of saline (white bar, n =5) or naloxone (grey bar, 
n=6) into chronic morphine-treated mice. c, Example confocal images 
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show that spontaneous withdrawal from morphine (bottom) but not saline 
(top) treatment increases the expression of c-Fos in PVT“¢ projection 
neurons. Left: scale bar, 100 1m. Right: magnified image shows the boxed 
area; scale bar, 50|1m. d, Quantification of CTB- (left) and c-Fos- (middle) 
positive cells per square millimetre and percentage of PVT projection 
neurons (right) expressing c-Fos induced by spontaneous withdrawal from 
morphine (grey bar, n = 4) or saline (white bar, n = 4). Mann-Whitney 
U-test. *P < 0.05, **P< 0.01. Mean +s.e.m. 
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Extended Data Figure 7 | See next page for figure caption. 
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Extended Data Figure 7 | The PVT—NAc pathway is required for 
expression of behavioural aversion to footshock and LiCl injection. 

a, Confocal images show robust expression of c-Fos (red) in the PVTNAc 
projection neurons (green) after footshock (top) and LiCl injection 
(bottom). Left: scale bar, 100 1m. Right: magnified image shows the boxed 
area; scale bar, 50j1m. b, Quantification of CTB- (left) and c-Fos- (middle) 
positive cells per square millimetre and percentage of PVT“ projection 


neurons (right) expressing c-Fos induced by footshock (white bar, n = 4) 
or LiCl injection (grey bar, n= 4). c, Light stimulation suppressed the 
expression of footshock-induced CPA in ArchT- (n= 8) but not eGFP- 
(n= 8) expressing mice. d, Local infusion of CNO reduced the expression 
of LiCl-induced CPA in hM4Di- (n = 8) but not eGFP- (n= 8) expressing 
mice. Mann-Whitney U-test. *P < 0.05, **P <0.01. Mean +s.e.m. 
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Extended Data Figure 8 | See next page for figure caption. 
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Extended Data Figure 8 | Effects of chronic morphine treatment 

and in vivo optogenetic LTD induction on the strength of AMPAR 
and NMDAR current in MSNs receiving input from the PVT or 

BLA. a, b, In the PVT—NAc pathway, chronic morphine treatment 
specifically increases AMPAR but not NMDAR current on D2-MSNs 
(saline/morphine, n = 13/16 cells) but not D1-MSNs (saline/morphine, 
n= 14/14 cells). Two-way ANOVA (AMPAR: F(,,53) = 5.24, P< 0.05; 
NMDAR: Fi,53) = 0.04, P= 0.83) followed by Tukey’s test. **P < 0.01. 

c, d, In the BLA—NaAc pathway, chronic morphine treatment increases 
both AMPAR current (c) and NMDAR current (d) on D1-MSNs (saline/ 
morphine, n = 14/13 cells) but not D2-MSNs (saline/morphine, n = 13/12 


cells). Two-way ANOVA (AMPAR: F(:,51) = 7.06, P< 0.05; NMDAR: 
Fq51) = 0.35, P=0.55) followed by Tukey’s test. *P < 0.05. e, Chronic 
morphine treatment has no effect on the AMPAR/NMDAR ratio in 
either D1-MSNs or D2-MSNs in the BLA—NAc pathway. Two-way 
ANOVA (Fi,,51) = 7.06, P< 0.05). f, g, In the PVT—NAc pathway, 

in vivo optogenetic stimulation (4 ms, 1 Hz, 900 pulses) in morphine- 
treated mice specifically decreases AMPAR but not NMDAR current 
in D2-MSNs (morphine/morphine + 1 Hz, n= 16/17 cells) but not 
D1-MSNs (morphine/morphine + 1 Hz, n= 14/14 cells). Two-way 
ANOVA (AMPAR: F(i,57) = 4.24, P= 0.04; NMDAR: F(,,57) = 0.01, 
P=0.92) followed by Tukey’s test. *P < 0.05. Mean +s.e.m. 


© 2016 Macmillan Publishers Limited. All rights reserved 


LETTER 


b ; ; morphine 
a [saline Ml morphine Hag"? 
+ 1Hz 
1.5 
oO 
& 
ii 
no 
2 
© 
& 1.0 
1) 
co) 2 
= S 
= o 
xe) 
S © 
£ ‘e 0.5 
oO 
fod) 
= iN) 
cao» 
or 
e* . 
Di-MSN D2-MSN 
Extended Data Figure 9 | Chronic morphine treatment and in vivo stimulation (4 ms, 1 Hz, 900 pulses) in morphine-treated mice had no 
optogenetic LTD induction does not affect paired-pulse ratio of obvious effect on the paired-pulse ratio of MSNs receiving PVT input 
MSNs receiving PVT input. a, b, Example traces (a) and quantification (two-way ANOVA, F(2,35) = 0.02, P= 0.97). D1-MSNs: saline/morphine/ 
(b) of paired-pulse ratio of photo-evoked EPSCs in D1- and D2-MSNs. morphine + 1 Hz, n = 9/7/6; D2-MSNs: saline/morphine/morphine + 1 Hz, 
An escalating regimen of morphine treatment and in vivo optogenetic n=7/6/6. Scale bars, 200 pA, 50 ms. Mean +s.e.m. 


© 2016 Macmillan Publishers Limited. All rights reserved 


LETTER 


Extended Data Table 1 | Statistical analysis for Figs 1-4 


1d (RTPP) 
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2h (Locomotion) 


3c (AN ratio) 


3e (Rectification index) 


4c (AN ratio) 


4e (Rectification index) 


4g (Behavioral signs) 


Groups (Ns refer to animals in behavior and c-fos 


experiments and cells in electrophysiology experiments) 


eGFP control (n = 8) 

ChR2 + saline (n = 10) 

ChR2 + NBQX (n = 8) 

ChR2 + D1R antagonist (n = 8) 
ChR2 + D2R antagonist (n = 8) 


Morphine + saline (n = 5) 
Morphine + naloxone (n = 6) 


eGFP control (n = 9) 
ArchT silencing (n = 10) 


eGFP control (n = 9) 
ArchT silencing (n = 10) 


Baseline (n = 5) 
CNO (n= 5) 
eGFP + CNO (n = 8) 


hM4D + CNO (n = 8) 
hM4D + saline (n = 8) 


eGFP control (n = 9) 
ArchT silencing (n = 9) 


Saline D1 (n = 14) 
Saline D2 (n = 13) 
Morphine D1 (n = 14) 
Morphine D2 (n = 16) 


Saline D1 (n = 8) 
Saline D2 (n = 7) 
Morphine D1 (n = 9) 
Morphine D2 (n = 19) 


Morphine D1 (n = 14) 
Morphine D2 (n = 16) 
Morphine + 1Hz, D1 (n = 14) 
Morphine + 1Hz, D2 (n = 17) 


morphine D1 (n = 9) 
morphine D2 (n = 10) 
depotentiation D1 (n = 7) 
depotentiation D2 (n = 10) 


eGFP control (n = 8) 
ChR2 depotentiation (n = 8) 


eGFP control (n = 8) 
ChR2 depotentiation (n = 8) 


Statistical Analysis 


One-way ANOVA: 
F(4, 37) = 29.61, p < 0.0001 


Post-hoc Tukey's test: 

ChR2 + saline vs. GFP control, p < 0.001 

ChR2 + saline vs. ChR2 + NBQX, p < 0.001 

ChR2 + saline vs. ChR2 + D1R antagonist, p = 0.98 
ChR2 + saline vs. ChR2 + D2R antagonist, p = 0.58 


Mann-Whitney U-test 
Morphine + saline vs. Morphine + naloxone, p < 0.01 


Mann-Whitney U-test 

Jump: GFP vs. Arch, p < 0.001 
Rearing: GFP vs. Arch, p <0.01 
Tremor: GFP vs. Arch, p < 0.001 
Mann-Whitney U-test 

1d CPA: GFP vs. Arch, p < 0.05 
7d CPA: GFP vs. Arch, p < 0.05 


Wilcoxon signed-rank test 
Baseline vs. CNO, p < 0.05 
One-way ANOVA: 

F(2, 21) =7.4, p <0.01 


Post-hoc Tukey's test: 

hM4D + CNO vs. GFP + CNO, p < 0.05 
hM4D + CNO vs. hM4D + saline, p< 0.01 
hM4D + saline vs. GFP + CNO, p = 0.55 
Wilcoxon signed-rank test 

Saline: GFP vs. Arch, p = 0.57 
Morphine: GFP vs. Arch, p = 0.5 
Two-way factorial ANOVA 

Group (saline, morphine) x Cell (D1, D2) 
Group: F(1,53) = 17.01, p< 0.001 

Cell: F(1,53) = 9.18, p<0.01 

Group X Cell: F(1,53) = 12.58, p< 0.001 


Post-hoc Tukey's test: 
saline D1 vs morphine D1, p = 0.97 
saline D2 vs morphine D2, p < 0.001 


Two-way factorial ANOVA 

Group (saline, morphine) x Cell (D1, D2) 
Group: F(1,30) = 6.65, p< 0.05 

Cell: F(1,30) = 4.32, p<0.05 

Group X Cell: F(1,30) = 9.87, p<0.01 


Post-hoc Tukey's test: 
saline D1 vs morphine D1, p = 0.97 
saline D2 vs morphine D2, p <0.01 


Two-way factorial ANOVA 

Group (morphine, morphine + 1Hz) x Cell (D1, D2) 
Group: F(1,57) = 21.59, p < 0.0001 

Cell: F(1,57) = 11.57, p< 0.01 

Group X Cell: F(1,57) = 13.86, p < 0.001 


Post-hoc Tukey's test: 

Morphine D1 vs morphine 1Hz D1, p = 0.97 
Morphine D2 vs morphine 1Hz D2, p< 0.001 
Two-way factorial ANOVA 

Group (morphine, morphine 1Hz) x Cell (D1, D2) 
Group: F(1,32) = 5.12, p<0.05 

Cell: F(1,32) = 8.23, p <0.01 

Group X Cell: F(1,32) = 4.3, p<0.05 


Post-hoc Tukey's test: 

saline D1 vs morphine D1, p = 0.92 
saline D2 vs morphine D2, p <0.05 
Mann-Whitney U-test 

Jump: GFP vs. ChR2, p < 0.01 
Rearing: GFP vs. ChR2, p <0.01 
Tremor: GFP vs. ChR2, p < 0.01 
Mann-Whitney U-test 

1d CPA: GFP vs. ChR2, p < 0.05 
7d CPA: GFP vs. ChR2, p <0.01 
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Hoxb5 marks long-term haematopoietic stem cells 
and reveals a homogenous perivascular niche 


James Y. Chen!**, Masanori Miyanishi!**, Sean K. Wang!?, Satoshi Yamazaki?, Rahul Sinha!?, Kevin S. Kao”, Jun Seita!, 


Debashis Sahoo!?+, Hiromitsu Nakauchi!? & Irving L. Weissman 


Haematopoietic stem cells (HSCs) are arguably the most extensively 
characterized tissue stem cells. Since the identification of HSCs by 
prospective isolation’, complex multi-parameter flow cytometric 
isolation of phenotypic subsets has facilitated studies on many 
aspects of HSC biology, including self-renewal”~*, differentiation, 
ageing, niche”, and diversity® *. Here we demonstrate by unbiased 
multi-step screening, identification of a single gene, homeobox 
B5 (Hoxb5, also known as Hox-2.1), with expression in the bone 
marrow that is limited to long-term (LT)-HSCs in mice. Using a 
mouse single-colour tri-mCherry reporter driven by endogenous 
Hoxb5 regulation, we show that only the Hoxb5* HSCs exhibit 
long-term reconstitution capacity after transplantation in primary 
transplant recipients and, notably, in secondary recipients. Only 
7-35% of various previously defined immunophenotypic HSCs 
are LT-HSCs. Finally, by in situ imaging of mouse bone marrow, 
we show that >94% of LT-HSCs (Hoxb5*) are directly attached to 
VE-cadherin* cells, implicating the perivascular space as a near- 
homogenous location of LT-HSCs. 

Prospective isolation of HSCs requires that the isolated cells are capa- 
ble of long-term production of all blood cell types in primary irradiated 
hosts, as well as self-renewal, such that the cells can be transplanted to 
secondary hosts to give rise to long-term multilineage repopulation. 
From the first enrichment and isolation of candidate HSCs!*!°, this 
activity has been entirely contained in cell-surface-marker-defined cell 
populations, and more recently in fluorescent reporters'!~'?. However, 
the precise fraction of cells in those populations that are true LT-HSCs 
remains unknown. 

To enable further purification of LT-HSCs, we sought to identify 
genes expressed exclusively in HSCs within cell populations resident in 
mouse bone marrow, detectable by flow cytometry and in situ fluores- 
cence, and thus performed the following four-step screening (Fig. 1d). 

First, we compared microarray gene expression assays among 28 dis- 
tinct populations of the haematopoietic system (Extended Data Fig. la 
and Supplementary Table 1). Using the Gene Expression Commons 
platform', we identified 118 candidate HSC-specific genes (Fig. la and 
Supplementary Table 2). Surprisingly, this list did not include all previ- 
ously reported HSC-specific markers'!"!? (Extended Data Fig. 1b and 
Supplementary Table 2). Second, to identify HSCs in situ, we excluded 
candidates that also label non-haematopoietic cells present in the 
bone marrow suchas stromal and endothelial cells!!°. Consequently, 
we excluded genes expressed in eight distinct non-haematopoietic 
bone marrow populations, thereby narrowing the list to 45 candidate 
genes (Fig. 1a). 

Next, to ensure that the expression of any candidates could be 
detected by both flow cytometry and in situ fluorescence, we used RNA- 
sequencing (RNA-seq) combined with a threshold gene standard to esti- 
mate the fragments per kilobase of transcript per million mapped reads 


1,2 


(FPKM) value that could serve as a detection threshold. From the bone 
marrow of 12-week-old mice, we sorted and RNA-sequenced immuno- 
phenotypically defined (Lin~c-Kit*Sca-1*CD150*CD34~°F1k27) 
HSCs (hereafter referred to as pHSCs), multipotent progenitors subset 
A (MPPa; Lin~c-Kit*Sca-1*CD150*CD34?Flk2_), and multipotent 
progenitors subset B (MPPb; Lin” c-Kit*Sca-1*CD150° CD34*Flk27) 
(Fig. 1b) to determine the FPKM value of candidate genes. On the 
basis of Bmi-1-eGFP knock-in reporter expression!’, we found that a 
single copy of eGFP is detectable at an estimated FPKM value of ~20. 
However, this high threshold would have excluded all candidate genes. 
Therefore, we designed a targeting construct (Fig. le) with three cop- 
ies of mCherry, bringing the theoretical detection limit to ~7 FPKM. 
Lastly, to minimize aberrant detection, we set threshold FPKM values 
for both the MPPa and MPPb fractions to 2.5. Only three genes, Hoxb5, 
Rnf208, and Smtnl1, met these criteria (Fig. 1b). 

Given previous reports of heterogeneity within pHSCs , We ana- 
lysed single cells to determine whether the remaining candidate genes 
were heterogenously expressed among pHSCs. We reasoned that an 
ideal pan-HSC candidate gene would label the majority of pHSCs, with 
quantitative differences potentially reflecting HSC heterogeneity/diversity. 
We thus performed single-cell quantitative PCR (qPCR) analysis of 
pHSCs, and evaluated expression of Hoxb5, Rnf208, and Smtnl1. Only 
Hoxb5 exhibited bimodal expression, in comparison to the unimodality 
of Rnf208 and Smtnll (Fig. 1c). Therefore, from the entire HSC tran- 
scriptome, only Hoxb5 satisfied the criteria of our extensive unbiased 
screening (Fig. 1d). 

We next sought to generate a Hoxb5 reporter with minimal disrup- 
tion of endogenous Hoxb5 function. Thus we designed our target- 
ing construct and CRISPR guide sequences to facilitate an in-frame 
knock-in to the endogenous Hoxb5 gene locus immediately 5’ of the 
only endogenous stop codon. We used three tandem mCherry cassettes 
separated by porcine teschovirus-1 2A (P2A) sequences, with the ter- 
minal mCherry carrying a CAAX membrane localization sequence 
(Hoxb5-tri-mCherry) (Fig. le). 

To evaluate the specificity of this reporter, we isolated whole bone 
marrow cells from 12-week-old reporter mice and measured mCherry* 
cells in the following immunophenotypic populations: pHSC, MPPa, 
MPPb, Flk2* multipotent progenitor, megakaryocyte erythrocyte 
progenitor, granulocyte monocyte progenitor, common myeloid pro- 
genitor, common lymphoid progenitor fractions; differentiated cell 
populations (B cell, T cell, natural killer (NK) cell, neutrophil, eosin- 
ophil, monocyte, macrophage, dendritic cell, red blood cell, and meg- 
akaryocyte); and in CD45~ stromal fractions (Fig. 1f, Extended Data 
Fig. 2a, b, Extended Data Fig. 3, and unpublished data). Consistent 
with our initial screen (Fig. la—d), and using wild-type mice as a 
fluorescence minus one (FMO) threshold?!, mCherry-labelled cells 
were highly enriched in the pHSC fraction (21.8% + 0.90%), had a low 
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Figure 1 | Multi-step unbiased screening identifies Hoxb5 as a candidate 
LT-HSC marker. a, Microarray heat map depicting relative expression 
(pink, high; blue, low) of HSC-specific genes in haematopoietic and 
stromal populations. Each row represents a gene; each subcolumn a 
replicate microarray; each labelled column a category of cell populations. 
The 45 genes in the top panel displayed limited activity in all non-HSC 
populations examined. b, Transcriptional profiling by RNA-seq of the 45 
genes from a. Three genes (top panel) exceeded the estimated threshold 
for detection (FPKM > 7.0) in HSCs while showing minimal expression 


frequency in the MPPa fraction (1.88% + 0.64%), and background fre- 
quencies in the remaining fractions (Fig. 1f, Extended Data Figs 2b and 3 
and unpublished data). Interestingly, as only a minority of pHSCs were 
mCherry*, this suggested that either our reporter labelled only a sub- 
fraction of HSCs or only a subfraction of pHSCs were indeed HSCs. 
To distinguish between these two possibilities and to determine 
whether Hoxb5 is a reporter of LT-HSCs, we characterized Hoxb5- 
expressing cells by transplantation. In order to be inclusive of all 
events in the pHSC gate, we used wild-type FMO to define Hoxb5 
negativity (Hoxb5"°8 or sometimes referred to as Hox5~) and divided 
the positive fraction (sometimes referred to as Hox5*) into Hoxb5" 
(top 5th percentile) and Hoxb5"° (14.1 +7.7%) (Extended Data Fig. 4). 
Ten-cell and three-cell grafts of Hoxb5", Hoxb5"°, or Hoxb5"°8 pHSCs 
were transplanted with supporting bone marrow cells into irradiated 
mouse recipients. We used CD45.2 expression to assess donor HSC 
contribution to haematopoietic lineages at 4-week intervals (Fig. 2a). 
Analysis of peripheral blood 16 weeks after transplantation of ten 
cells demonstrated that multilineage reconstitution was present in 
78% of Hoxb5", 70% of Hoxb5", and 44% Hoxb5"*8 pHSC recipients 
(Fig. 2b and Extended Data Fig. 5a). Three-cell transplants exhibited 
similar kinetics (Extended Data Fig. 5c). Notably, the chimaerism of 
the Hoxb5"°8 pHSCs decreased over time, in particular between 4 to 
8 weeks and shifted towards a predominantly lymphoid chimaerism 
(Fig. 2b, d and Extended Data Fig. 5a), suggesting that either the 
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expression for the three remaining candidate genes in HSCs as assessed 
by single-cell qPCR. d, Venn diagram reflecting the four-step screen 

that identified Hoxb5 as an ideal candidate in the HSC transcriptome. 

e, Targeting strategy to generate a triple-mCherry Hoxb5 knock-in mouse 
reporter line (Hoxb5-tri-mCherry). UTR, untranslated region; PGK, 
phosphoglycerate kinase I. f, Hoxb5 reporter expression (red) in pHSCs 
and MPPs compared to wild-type controls (blue). Values indicate the 
percentage of mCherry” cells + s.d. in each fraction for n =3 mice. 


Hoxb5"*8 fraction comprised of lymphoid-biased HSCs or, more likely, 
transiently self-renewing short-term (ST)-HSCs and/or MPPs that had 
given rise to long-lived lymphocytes. 

To evaluate these two possibilities, we carried out a secondary trans- 
plantation of whole bone marrow from primary Hoxb5", Hoxb5", or 
Hoxb5"* pHSC recipients into lethally irradiated secondary recipients 
(Fig. 2a). Sixteen weeks after secondary transplantation, peripheral blood 
analyses revealed robust multilineage chimaerism from all Hoxb5"™ 
and Hoxb5"° transplant recipients, with minimal chimaerism from the 
Hoxb5"*S fraction (Fig. 2c, e and Extended Data Fig. 5b). Furthermore, 
bone marrow analysis of primary recipients for donor pHSCs revealed 
that the Hoxb5" pHSC recipients contained Hoxb5", Hoxb5"°, and 
Hoxb5"*S cells (100%, m= 10 mice), whereas the recipients of Hoxb5"8 
cells were only Hoxb5"*8 (36%, n =4 mice) or devoid of donor cells (64%, 
n=7 mice). These results further suggested that the Hoxb5"°? pHSCs 
were in fact transiently self-renewing ST-HSCs/MPPs. To determine if 
Hoxb5 expression could also distinguish LT-HSCs in a primary trans- 
plant, we normalized the number of donor cells used for secondary 
transplant by sorting 100 Lin’c-Kit*Sca-1* (LSK) donor cells from the 
primary Hoxb5" recipients and transplanted them into irradiated second- 
ary recipients (n= 24 mice) (Fig. 2a). As with the primary bone marrow 
transplantations, the chimaerism was minimal in Hoxb5~ compared to 
Hoxb5? recipients (Extended Data Fig. 5d). Limiting dilution analysis 
revealed that the frequency of long-term plus short-term HSCs in primary 
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hosts at 16 weeks was 1 in 2.1 for Hoxb5"', 1 in 2.4 for Hoxb5!, and 1 in 
16.1 for Hoxb5"* recipients (Extended Data Fig. 6). Taken together, these 
results demonstrate that Hoxb5 labels functional LT-HSCs. 

Given that Hoxb5™ cells are non-LT-HSCs, we re-examined the 
specificity of past definitions (Fig. 3a, f), including previously reported 
refinements to the LT-HSC immunophenotype!*” and the most 
widely used in situ definition over the past decade”. We found that 
78.5% + 2.6% of CD1la~ HSCs!°, 63.9% + 3% of the HSC-1 (Lin~ 
c-Kit'Sca-1*CD48~CD150*CD229-CD244—)”°, and 82.5% + 0.4% of 
fraction I HSCs (Lin~c-Kit*Sca-1+ CD347/"°CD150*CD417)!8 were 
Hoxb5: (Fig. 3b, c, and d, respectively). Surprisingly, 91.3% + 0.4% 
of Lin~CD48~ CD41~CD150* (ref. 22) cells were Hoxb5~ (Fig. 3e). 
As this subset was initially used for localization of HSCs in situ’? 
we re-examined the in situ location of HSCs using Hoxb5 expression. 

Visualizing LT-HSCs in bone marrow and identifying the cellular 
constituents and structures of the HSC niche remains challenging. 
Despite this, multiple constituent cell types have been proposed, includ- 
ing mesenchymal stromata, endosteal osteoblasts, glia, endothelia, 
and pericytes’. In situ studies are made difficult by several technical 
limitations, including the number of fluorescent colours, difficulty in 
identifying HSCs surrounded by non-HSCs, and difficulty in trans- 
lating the same fluorescently defined positive and negative thresholds 
from flow cytometry to tissue sections. 

To address these issues, we used the Hoxb5-tri-mCherry reporter. 
Using flow cytometry, we determined the utility of Hoxb5 alone in 
identifying LT-HSCs. After logical exclusion of autofluorescence by 
comparison to wild-type control mice, we found that all Hoxb5* events 
are within the c-Kit* fraction and 62.0 + 12.8% of all Hoxb5* events 
are located in the pHSC gate (Extended Data Fig. 8), representing an 
eight- to nine-fold enrichment compared to previous in situ labelling 
of HSCs (Lin~CD48~ CD41~CD150*)” (Fig. 3e). 

To reveal the three-dimensional (3D) HSC niche architecture, we 
applied the CUBIC technique (clear, unobstructed brain imaging cock- 
tails and computational analysis”) to tibial bone marrow, facilitating 
depletion of autofluorescent cellular components (Fig. 4a). Given 
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Figure 3 | Previously defined HSC immunophenotypes contain 
Hoxb5~ cells. Flow cytometry plots of bone marrow from 12-week-old 
mice depicting Hoxb5-tri-mCherry reporter activity in previously 
reported HSC immunophenotypes. a, pHSC (LSK CD150*CD34/°F]k2~) 
Hoxb5* (red) and Hoxb5~ (blue). b, CD11a~ (LSK CD150*CD34~/° 
CD11a~)!°. c, HSC-1 (LSK CD150+CD48~/"°CD229-/"°CD244-)°, 

d, Fraction I (LSK CD150*CD34~/°CD41-)!8. e, CD150*CD48-CD41- 
cells, currently used to identify HSCs in situ”. Wild-type FMO used to 
define Hoxb5 negativity for each panel. f, Summary percentage of Hoxb5* 
and Hoxb5~ negative cells in characterized HSC subfractions. Error bars 
denote s.d.; n =5 mice (see Extended Data Fig. 7a—d for gating scheme). 
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Figure 4 | LT-HSCs exhibit near-homogenous attachment to 
VE-cadherin* cells. a, Tissue preparation and representative images 

of tibial bone marrow plug after paraformaldehyde fixation (day 0) and 
treatment with reagent-1 (see ref. 23) (day 7, day 14). b, Localization 

of Hoxb5* cells (red and arrows) and VE-cadherin* cells (green) 

in 3D-reconstructed images. Scale bar, 301m. c, Representative 2D 
images of direct (top panel) and non-direct (bottom panel) association of 


that fluorescence intensity is preserved by CUBIC”, the specificity of 
LT-HSCs in situ correlates with Hoxb5 intensity determined by flow 
cytometry. Therefore, on the basis of the number of Hoxb5+t LT-HSCs 
detected in one tibia (Extended Data Fig. 9a, b), we objectively gated 
the same number of Hoxb5* cells by intensity as in situ Hoxb5* cells. 
We observed a uniform distribution of in situ Hoxb5* cells along the 
longitudinal axis of the tibia (Fig. 4e and Extended Data Fig. 10a-c). 
The average percentage of in situ Hoxb5*~ cells was 30.8 +5.9 in the 
proximal epiphysis, 39.0 + 2.4 in the diaphysis, and 30.1 +5.4 in the 
distal epiphysis per field (1.25 x 1.25 x 0.4 mm?) (Fig. 4e). 

To investigate the association of LT-HSCs with vasculature, we 
injected anti-mouse VE-cadherin antibody into reporter mice and 
analysed the bone marrow with CUBIC”. We found that 94.1 + 1.9% 
of in situ Hoxb5* cells were directly attached to the abluminal surface 
of VE-cadherin* endothelial cells. In contrast, only 52.8 + 2.3% of the 
random (Hoxb5_ ) spots were directly associated with VE-cadherin* 
cells. This implied a near-homogenous perivascular location for the 
LT-HSC niche (Fig. 4b-d). 

Following the first successful enrichment of HSCs in 1988 (ref. 1), 
many groups have attempted to identify surface markers to isolate LT- 
HSCs!8-?0.22,2425 Tdentification of CD150, CD34, and CD48 enabled 
isolation of LT/ST-HSCs from MPPs. However, complete separation of 
LT-HSCs from ST-HSCs has never been fully accomplished. Our study 
demonstrates that Hoxb5 expression is specifically limited to LT-HSCs 
in adult mouse bone marrow. Limiting dilution assay shows that at 
least 1 in 2.1 of Hoxb5"' cells are LT-HSCs. However, this assay under- 
estimates the functional potential of candidate HSCs. Transplantation, 
although the gold standard for prospective isolation of HSCs, may not 
take into account differences between sessile and mobile HSCs”°?’, 
cell cycle status*®”°, expression of CD47 (an anti-phagocytic mole- 
cule expressed highly on mobilized HSCs but at low levels on sessile 
bone marrow HSCs”), irradiation, and their influence on engraftment 
efficiency. 

Our results demonstrate that the LT-HSCs compartment is near- 
homogenously perivascular. Although other compartments have been 
implicated’, including less homogenous association of candidate HSCs 
with the vasculature!', it is likely that our results differ owing to the 
fraction of assayed populations that are LT- HSCs (Extended Data 
Fig. 1b and Supplementary Table 3). These results suggest that the 
model of Hoxb5-tri-mCherry is the most specific available, to our 
knowledge, for identification of LT-HSCs, and could facilitate in situ 
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lineage tracing to define their role in haematopoiesis in the absence of 
transplantation into irradiated hosts. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Data Reporting. No statistical methods were used to predetermine sample size. 
The experiments were not randomized and the investigators were not blinded to 
outcome assessment. 

Mice. Eight-to-twelve-week-old C57BL/6J male mice (Jackson Laboratory) were 
used as wild-type controls. Eight-to-twelve-week-old male B6.SJL- Ptprc* Pepc’/ 
BoyJ mice (Jackson Laboratory) were used as recipients for transplantation assay. 
Supporting cells for competitive reconstitution assays were collected from B6.SJL- 
Ptprc* Pepc’/Boy] x C57BL/6J (F, mice CD45.1*/CD45.2*). Mice were bred at our 
animal facility according to NIH guidelines. All animal protocols were approved by 
the Stanford University Administrative Panel on Laboratory Animal Care. Hoxb5- 
tri-mCherry (C57BL/6J background) mice were used as donor cells for trans- 
plantation as well as for analysis. Please see Gene targeting for mouse derivation. 
Microarray data. All microarray data employed in this study are available at Gene 
Expression Commons (http://gexc.stanford.edu) and GEO GSE77078. Two to 
four microarray replicates were assessed for each distinct cell population!*"®. The 
immunophenotype definition of each fraction is included in Supplementary Table 1. 
RNA sequencing. RNA sequencing was performed as previously described?!. In brief, 
total RNA was isolated with trizol, treated with RQ] RNase free DNase (Promega) to 
remove minute quantities of genomic DNA if present, and cleaned up using RNeasy 
minelute columns (QIAGEN). cDNA libraries were prepared for pHSC, MPPa, and 
MPPb populations using Ovation RNA-Seq System V2 (NuGen) and sequenced 
separately using HiSeq 2500 (Illumina) to obtain 2 x 150 base pair (bp) paired-end 
reads. Raw transcriptome sequence data were mapped to Mus musculus reference 
mRNAs using OLego” to produce a reference-guided transcript assembly. Four rep- 
licates were sequenced for each population. Data are accessible at NCBI SRP068593. 
Single-cell qPCR. Single cells for qPCR were processed using the Single Cell-to-CT 
qRT-PCR Kit (Life Technologies) as per the manufacturer’s instructions. Cells were 
sorted directly into lysis solution in a 96-well plate, subjected to a reverse transcrip- 
tion reaction for cDNA synthesis, amplified for 14 cycles with pooled TaqMan Gene 
Expression Assays, and diluted with 1x TE buffer (pH 8.0). For real-time PCR, sam- 
ples were amplified for 50 cycles of 3s at 95°C followed by 30s at 60°C using the fol- 
lowing TaqMan probes: Mm00657672_m1 for Hoxb5; Mm03039759_s1 for Rnf208; 
Mm00470338_m1 for Smtnl1; and Mm99999915_g1 for Gapdh. All thermocycling 
was performed on a 7900HT Fast Real-Time PCR System (Applied Biosystems). 
Presence of a single cell was validated by a C, value of 30 or less for Gapdh. 

Gene targeting. The targeting construct was cloned into pJYC (derived from 
pUC19, golden gate insert of TALE backbone vector*’, and removal of LacZ 
sequence). Right and left homology arms of ~700 bp were cloned by PCR 
from C57BL/6J genomic DNA, and the entire construct was sequence vali- 
dated. BRUCE-4 embryonic stem (ES) cells (Millipore) derived from C57BL/6J 
mice were transfected with the targeting construct as well as bicistronic 
CRISPR vector PX330 expressing Cas9 and a Hoxb5-specific single guide RNA 
(5‘-GGCUCCUCCGGAUGGGCUCA-3’)™. Following homologous recombi- 
nation, recombinant clones were positively selected for one week with neomycin 
(100 jg ml~)), transfected with an EF 1la-Cre-puromycin vector, and selected for 
a second week with puromycin (1 ,1gml~). Serially diluted ES colonies were then 
individually picked, expanded, and screened by PCR and qPCR for site-specific 
integration, exclusion of off-target effects, and correct copy number. After sequence 
validation of the targeted site, successfully targeted ES clones were used to establish 
chimaeras. Chimaeric mice were crossed with C57BL/6 — Tyr“7/J female mice to 
establish germline transmission. 

Flow cytometry and cell sorting. Flow cytometry and cell sorting were performed 
ona FACS Aria II cell sorter (BD Biosciences) and analysed using FlowJo software 
(Tree Star). Bone marrow cells were collected from bilateral tibias, femurs, humeri, 
and pelvises by crushing (unless otherwise specified) using a mortar and pestle in 
Ca**- and Mg”*-free PBS supplemented with 2% heat-inactivated bovine serum 
(Gibco) and 2mM EDTA. Cells were passed through 100,1m, 70|1m, and 40m 
strainers before analysis and sorting. To enrich HSCs and progenitor populations, 
cells were stained with APC-conjugated anti-c-Kit (2B8) and fractionated using 
anti-APC magnetic beads and LS columns (both Miltenyi Biotec). c-Kit* cells 
were then stained with combinations of antibodies against the following surface 
markers: Sca-1, Flk2, CD150, CD34, IL-7R, CD16/32, and the lineage markers 
Ter-119, B220, CD2, CD3, CD4, CD5, CD8a, Gr-1, CD11a, CD11b, CD41, CD48, 
CD229, and CD244. For lymphoid populations, bone marrow cells were stained 
with antibodies against CD3, CD4, CD8a, CD11b, CD11c, Gr-1, NK-1.1, Ter-119, 
B220, c-Kit, and F4/80. For myeloid populations, cells were stained with antibodies 
against CD3, CD3-, CD11b, CD11c, CD19, Gr-1, NK-1.1, Ter-119, and F4/80. 
Antibody staining was performed at 4°C and cells were incubated for 30 min. 
Cells stained with CD34 were incubated for 90 min. Before sorting or analysis, 
cells were stained with SYTOX Red Dead Cell Stain (Life Technologies) to assess 
viability as per the manufacturer’s recommendations. Transplanted cells were 
double-sorted for purity. Further details regarding flow cytometry reagents are 
provided in Supplementary Table 4. 


Transplantations and peripheral blood analyses. B6.SJL-Ptprc* Pepc’/Boy] 
(Jackson Laboratory) recipient mice were lethally irradiated at a single dose of 
9.1 Gy. For reconstitution assays, donor cells were first combined with 2 x 10° 
whole bone marrow supporting cells (B6.SJL- Ptprc* Pepc’/Boy] x C57BL/6] Fy 
mice CD45.1*/CD45.2*) in 20011 of PBS with 2% FBS, then injected into the 
retro-orbital venous plexus. Peripheral blood analyses were performed at 4, 8, 12, 
and 16 weeks after primary and secondary transplantations. At each time point, 
50 1 of blood was collected from the tail vein and added to 100,11 of PBS with 
2mM EDTA. Red blood cells were subsequently lysed using BD Pharm Lyse Buffer 
(BD Pharmingen), as per the manufacturer’s protocol, for 3 min on ice, followed by 
blocking with 5.g ml“! rat IgG. Leukocytes were stained with antibodies against 
(refer to Supplementary Table 4 for specific clones and colours): CD45.1 (FITC), 
CD45.2 (PE), CD11b (BUV395), Gr-1 (Alexa-Flour700), B220 (BV786), CD3 
(BV421), TCRB (BV421), and NK-1.1 (PerCP-cy5.5). For each mouse, the per- 
centage of donor chimaerism in the peripheral blood was defined as the percentage 
of CD45.1~CD45.2* cells among total CD45.1~ CD45.2* and CD45.1*CD45.2* 
cells. To control for variability in host response to lethal irradiation, mice with host 
chimaerism of 50% or higher 16 weeks after transplantation were excluded from 
our analyses. The frequency of chimaerism in peripheral blood was analysed as 
follows. For evaluation of donor (CD45.1~ CD45.2*) chimaerism kinetics, after 
exclusion of recipient (CD45.1*CD45.2_) fraction, the frequency of the donor frac- 
tion was calculated. Within the whole donor fraction, the frequency of each lineage 
(NK cell, B cell, T cell, granulocyte, and monocyte) was determined. For evaluation 
of lineage contribution kinetics, after gating each lineage, the frequency of the 
donor fraction (CD45.1~CD45.2*) was calculated. Any recipients that exhibited 
lower than 1% of chimaerism were treated as negative to exclude ambiguous cases. 
Limiting dilution analysis. The frequency of long-term and short-term HSCs was 
calculated using the transplantation data of ten- and three-cell Hoxb5"i, Hoxb5!°, 
or Hoxb5"* pHSC transplants. Any mice showing long-term (>16 week) multi- 
lineage reconstitution (>1% in each lineage) were counted as positive recipients. 
A nonlinear regression semi-log best fit line was used to calculate the frequency 
of LT/ST-HSCs at Fo = 0.368 (GraphPad Prism 6). 

CUBIC bone marrow imaging. Bone clearing protocol was modified from the 
original CUBIC protocol”. Specifically, tibias were collected and fixed in 4% PFA 
solution for two days, after which bone marrow plugs were extracted from the distal 
end by flushing method with a 25-gauge syringe. For nuclear staining, bone mar- 
row plugs were immersed in DAPI/PBS solution at 37°C for three days with gentle 
shaking. For clearing, marrow plugs were immersed in ScaleCUBIC-1 (Reagent-1) 
at 37°C for two weeks with gentle shaking. The solution was changed every 48h. To 
visualize vasculature, 201g of Alexa488-conjugated anti-mouse VE-cadherin anti- 
body (BV13) was administrated intravenously (retro-orbital) with tibias collected 
30 min later. Processed plugs were embedded in 4-mm-diameter glass capillaries 
with 2% agarose for imaging. Images were acquired using a Zeiss Z1 Lightsheet 
microscope (Zeiss) and reconstituted into 3D images using Zen software (Zeiss). 
Acquired 3D images were analysed with Imaris software (Bitplane). After exclusion 
of outliers including events of extraordinary size (>30,1m) or intensity, all other 
mCherry*t (Hoxb5*) cells were analysed in the tibia (m = 287 cells in total from 
n=3 mice). The mCherry-negative threshold was determined on the basis of the 
intensity level of the wild-type control tibial plugs. These mCherry~ events with 
intensities ranging from 0.002-75.998 (75.998 representing the upper bound of 
mCherry intensity) were transformed into an integer list of 75,000 values (75,000 
representing the length of the list), and 600 spots were then randomly selected 
from this integer list using the ‘randbetweer’ (1,75,000) function in Excel 2015 
(Microsoft). Using the list of random spots identified by intensity, the location and 
distance to VE-cadherin‘ cells was measured using the Imaris software. All CUBIC 
imaging experiments were performed in biological triplicates from three mice. 
Genotyping. Genomic DNA from Hoxb5-tri-mCherry mice was isolated 
from tail biopsies using QuickExtract solution (EpiCentre). PCR amplification 
was performed using the same forward primer (5’-GACGTATCGAGATCGC 
CCAC-3’) with two reverse primers to distinguish between the Hoxb5- 
tri-mCherry (5’-CCTTGGTCACCTTCAGCTTGG-3’) and wild-type 
(5'‘-AGATTGGAAGGGTCGAGCTG-3’) alleles. 

Statistics. All comparative analyses were performed using unpaired Student’s 
t tests. Pearson's chi-square test was performed using online software (http:// 
vassarstats.net/). 
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Extended Data Figure 1 | GEXC expression of previously reported (top left), a-catulin/Ctnnall (top middle), Fgd5 (top right), 

HSC markers in mouse bone marrow. a, Ideal expression pattern of CD150/Slamf1 (bottom left), Hoxb4 (bottom middle), Gfi-1 (bottom right) 
HSC-specific genes (pink represents increased expression, blue in haematopoietic and stromal populations as determined by microarray 
represents decreased expression). b, Relative expression of Hoxb5 analysis. 
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Extended Data Figure 2 | Gating scheme for HSC and progenitors. granulocyte monocyte progenitor (GMP), common myeloid progenitor 
a, Representative flow cytometry gating to isolate pHSCs, MPPs, and (CMP), and common lymphoid progenitor (CLP) populations compared 
oligopotent progenitors from mouse bone marrow. Panels gated as shown to wild-type controls (blue). Values indicate the percentage of mCherry* 
after exclusion of doublets and dead cells. b, Hoxb5 reporter expression cells + s.d. in each fraction for n =3 mice. 


(red) in FIk2* MPPs, megakaryocyte erythrocyte progenitor (MEP), 
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Extended Data Figure 3 | Hoxb5 is not expressed in CD45~ bone 
marrow. Hoxb5 reporter expression in the CD45~ compartment within 
bone marrow of wild-type (red) and three Hoxb5-tri-mCherry mice 
(blue, orange, and green, n =3 mice). 
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Extended Data Figure 4 | FMO gating for Hoxb5* signal. Representative 
flow cytometry gating to separate mCherry (Hoxb5)-high, -low, and -negative 
populations in both wild-type and Hoxb5-tri-mCherry mice. 
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Extended Data Figure 5 | Hoxb5 distinguishes between LT-HSCs and 
non-LT-HSCs. a, Reconstitution kinetics in primary recipients 4, 8, and 
12 weeks after receiving ten Hoxb5"°8 (n= 9 mice), Hoxb5!° (n= 13 mice), 
or Hoxb5™ (n= 18 mice) pHSCs. Each column represents an individual 
mouse. b, Reconstitution kinetics 4, 8, and 12 weeks after whole bone 
marrow secondary transplant. c, Reconstitution kinetics in primary 


recipients receiving three Hoxb5"°8 (n= 11 mice), Hoxb5" (n= 12 mice), 
or Hoxb5"™ (n= 14 mice) pHSCs. Each column represents an individual 
mouse. d, Reconstitution kinetics following secondary transplant of 100 
sorted LSK Hoxb5~ (n= 14 mice) or Hoxb5+ (n =9 mice) cells and 2 x 10° 
supporting cells. 
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Extended Data Figure 6 | Limiting dilution analysis of Hoxb5* 

and Hoxb5~ pHSCs. Limiting dilution results of ten- and three-cell 
transplants of Hoxb5"i (red, n = 18 mice for ten-cell and n= 14 mice for 
three-cell), Hoxb5!° (green, n = 13 mice for ten-cell and n= 12 mice for 
three-cell), and Hoxb5"*S (blue, n =9 mice for ten-cell and n= 11 mice 
for three-cell). Frequency of LT/ST-HSCs by limiting dilution analysis is 1 
in 2.1 for Hoxb5", 1 in 2.4 for Hoxb5", and 1 in 16.1 for Hoxb5"*S cells. 
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Extended Data Figure 7 | Previously defined HSC immunophenotypes contain Hoxb5~ cells. Representative HSC gating strategy for various HSC 
definitions after exclusion of doublets and dead cells. a, CD1la~ (LSK CD150+CD34~/"°CD11a~)?!. b, HSC-1 (LSK CD150+CD48~"°CD229-/"°CD244-)°, 
c, Fraction I (LSK CD150*CD34~"°CD41-)!8, d, CD150*CD48~CD41~ cells”? (n =5 mice). 
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Extended Data Figure 8 | Specificity of Hoxb5 as a single marker for LT-HSCs. a, Flow cytometry plots of wild type (top row) and Hoxb5-tri-mCherry 
(bottom row) after excluding doublets, dead cells, autofluorescence, and gating on Hoxb5* events. Frequencies shown are percentage in gate 4 


each fraction (n =3 mice). 
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Extended Data Figure 9 | Comparison of processing methods on pHSC 
and Hoxb5* LT-HSC yield. a, b, Relative frequency of pHSCs (a) and 
Hoxb5* LT-HSCs (b) in tibial plugs (flushed) (n= 6 mice) compared to 
tibial plugs plus bones (crushed) (n =6 mice). 
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Extended Data Figure 10 | Hoxb5* HSCs are evenly distributed in the 
tibia. a, Distribution of Hoxb5* cells (red and arrows) in bone marrow 
in 3D-reconstructed images. Nuclei are counterstained with DAPI (blue) 
wild-type (top panel) Hoxb5-tri-mCherry (middle and bottom panel). 
Scale bar, 100 1m. b, Cartoon representing the location of the proximal, 


Proximal Medial Distal 


medial, and distal sampling. c, Representative 3D-reconstructed images 
of Hoxb5* cells (red) in proximal (left column), medial (middle column), 
and distal (right column) regions of the tibia. Scale bar, 150 jm. Nuclei are 
counterstained with DAPI (blue); n = 3 mice. 
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The Ebola virus disease epidemic in West Africa is the largest on 
record, responsible for over 28,599 cases and more than 11,299 
deaths!. Genome sequencing in viral outbreaks is desirable to 
characterize the infectious agent and determine its evolutionary rate. 
Genome sequencing also allows the identification of signatures of 
host adaptation, identification and monitoring of diagnostic targets, 
and characterization of responses to vaccines and treatments. The 
Ebola virus (EBOV) genome substitution rate in the Makona 
strain has been estimated at between 0.87 X 107? and 1.42 x 1073 
mutations per site per year. This is equivalent to 16-27 mutations 
in each genome, meaning that sequences diverge rapidly enough 
to identify distinct sub-lineages during a prolonged epidemic? ’. 
Genome sequencing provides a high-resolution view of pathogen 
evolution and is increasingly sought after for outbreak surveillance. 
Sequence data may be used to guide control measures, but only if 
the results are generated quickly enough to inform interventions®. 
Genomic surveillance during the epidemic has been sporadic 


owing to a lack of local sequencing capacity coupled with practical 
difficulties transporting samples to remote sequencing facilities”. 
To address this problem, here we devise a genomic surveillance 
system that utilizes a novel nanopore DNA sequencing instrument. 
In April 2015 this system was transported in standard airline 
luggage to Guinea and used for real-time genomic surveillance of 
the ongoing epidemic. We present sequence data and analysis of 
142 EBOV samples collected during the period March to October 
2015. We were able to generate results less than 24 h after receiving 
an Ebola-positive sample, with the sequencing process taking as 
little as 15-60 min. We show that real-time genomic surveillance is 
possible in resource-limited settings and can be established rapidly 
to monitor outbreaks. 

Conventional sequencing technologies are difficult to deploy in 
developing countries, where availability of continuous power and cold 
chains, laboratory space, and trained personnel is restricted. In addi- 
tion, some genome sequencer instruments, such as those using optical 
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Figure 1 | Deployment of the portable genome surveillance system in 
Guinea. a, We were able to pack all instruments, reagents and disposable 
consumables within aircraft baggage. b, We initially established the 
genomic surveillance laboratory in Donka Hospital, Conakry, Guinea. 

c, Later we moved the laboratory to a dedicated sequencing laboratory in 
Coyah prefecture. d, Within this laboratory we separated the sequencing 
instruments (on the left) from the PCR bench (to the right). An 
uninterruptable power supply can be seen in the middle that provides 
power to the thermocycler. (Photographs taken by J.Q. and S.D.) 


readings, for example the Illumina platform, require precise microscope 
alignment and repeated calibration by trained engineers”!”. Recently, 
a new highly portable genome sequencer has become available. The 
MinION (Oxford Nanopore Technologies, Oxford, UK) weighs less 
than 100g. Data are read off the MinION from a laptop via a Universal 
Serial Bus (USB) port from which the instrument also draws power. 
The MinION works by taking frequent electrical current measurements 
as a single strand of DNA passes through a protein nanopore at 30 
bases per second. DNA strands in the pore disrupts ionic flow, result- 
ing in detectable changes in current that is dependent on the nucleo- 
tide sequence. Because the MinION detects single molecules it has a 
much higher error rate (between 10-20%!) than high-throughput 
instruments that read clonal copies of DNA molecules. Single-molecule 
sequencing has the advantage of being able to read extremely long 
molecules of DNA (50kb or longer!*"). In order to generate accurate 
sequences, genomic regions must be read many times, with errors elim- 
inated through consensus averaging. This system has previously been 
used to investigate a bacterial outbreak, but not yet a viral outbreak"*. 
We designed a laboratory protocol to permit EBOV genome sequenc- 
ing on the MinION that employed a targeted reverse transcriptase pol- 
ymerase chain reaction (RT-PCR) in order to isolate sufficient DNA 
for sequencing. We considered and rejected an alternative approach, 
that of total RNA sequencing, as this approach also amplifies human- 
derived transcripts and dilutes viral signal’. We designed a panel of 
38 primer pairs that would span the EBOV genome (Extended Data 
Fig. 1a, Supplementary Table 1). In pilot experiments at the Defence 
Science and Technology Laboratory (Dstl) Porton Down, UK, we 
sequenced a historic Zaire Ebolavirus using MinION as well as the 
Illumina MiSeq. Due to difficulties obtaining equal balancing of each 
of the 38 amplicon pairs only 65.7% of the EBOV genome was covered 
by at least 25 reads, compared with 87.4% on Illumina. However, nucle- 
otide variants in those highly covered regions were concordant with 
those obtained from Illumina sequencing, with the exception ofa single 
variant in a homopolymeric region. MinION sequencing currently can- 
not easily resolve the length of homopolymers of 5 bases or greater’®. 
Next we designed a genome surveillance system that could be 
transported to West Africa. The system consisted of three MinION 
instruments (Oxford Nanopore Technologies, UK), four laptops, 
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Figure 2 | Real-time genomics surveillance in context of the Guinea 
Ebola virus disease epidemic. a, Here we show the number of reported 
cases of Ebola virus disease in Guinea (red) in relation to the number 

of EBOV new patient samples (n = 137, in blue) generated during this 
study. b, For each of the 142 sequenced samples, we show the relationship 
between sample collection date (red) and the date of sequencing (blue). 
Twenty-eight samples were sequenced within three days of the sample 
being taken, and sixty-eight samples within a week. Larger gaps represent 
retrospective sequencing of cases to provide additional epidemiological 
context. 


a thermocycler, a heat block, pipettes and sufficient reagents and con- 
sumables to perform sequencing (a full list of equipment is shown in 
Extended Data Fig. 2). We were able to pack this into less than 50kg 
of standard airline travel luggage (Fig. 1a). We initially installed the 
genome surveillance system in the European Mobile Laboratory in 
Donka Hospital in Conakry, Guinea (Fig. 1b). Later on, the equipment 
was moved to a dedicated laboratory, located within the Coyah Ebola 
Treatment Unit (Fig. Ic, d). 

We started sequencing genomes within 2 days of arriving in Guinea. 
We found early on that we were able to reliably generate long amplicons 
(around 2 kb in length) using primer pairs (Supplementary Table 4) in 
different combinations (Extended Data Fig. 1b, c). Using as few ampli- 
cons as possible significantly reduces effort when preparing samples. 
We found a combination of 11 amplicons that reliably amplified >97% 
of the EBOV genome. 

We developed a bioinformatics approach that would yield accurate 
genotypes, and validated this using Makona virus samples from a pre- 
vious study*. The bioinformatics workflow is detailed in the Methods 
and summarized in Extended Data Fig. 3. This validation process 
demonstrated that our bioinformatics analysis approach was robust. We 
compared our consensus sequences to those generated using Illumina 
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sequencing and found that our approach was highly concordant, with 
no false positive variant calls. In several cases, we were unable to deter- 
mine variants because they fell either within the primer binding region, 
or they were outside of the regions of the EBOV genome covered by 
our amplicon set (Extended Data Fig. 4a). These positions are repre- 
sented as ambiguous nucleotides in the final consensus sequences used 


for analysis. Despite these masked positions, phylogenetic inference 
showed that samples clustered identically (Extended Data Fig. 4b). 
We determined that, despite the instrument's high error rate, use of 
electrical current information meant that 25-fold read coverage of 


genome positions was sufficient to determine accurate genotypes 
(Extended Data Fig. 5). 
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Figure 3 | Evolution of EBOV over the course of the Ebola virus disease 
epidemic. a, Time-scaled phylogeny of 603 published sequences with 

125 high quality sequences from this study. The shape of nodes on the 
tree demonstrates country of origin. Our results show Guinean samples 
(coloured circles) belong to two previously identified lineages, GN1 and 
SL3. b, GN1 is deeply branching with early epidemic samples. c, SL3 is 
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related to cases identified in Sierra Leone. Samples are frequently clustered 
by geography (indicated by colour of circle) and this provides information 
as to origins of new introductions, such as in the Boké epidemic in May 
2015. Map figure adapted from SimpleMaps website (http://simplemaps. 
com/resources/svg-gn). 


© 2016 Macmillan Publishers Limited. All rights reserved 


After deployment of the genome surveillance system, we worked in 
partnership with diagnostic laboratories in Guinea to provide real-time 
sequencing results to National Coordination in Guinea and the World 
Health Organisation (WHO). Collaborating laboratories provided left- 
over diagnostic RNA extracts for sequencing. The genome sequenc- 
ing workflow, including amplification, sequence library preparation 
and sequencing could be accomplished within a working day. In one 
case, including remote bioinformatics analysis, the fastest time from 
patient sample to answer was achieved in less than 24 h (Supplementary 
Table 1), although the protocol was more usually performed over two 
working days. We found that in half of cases, we were able to generate 
sufficient reads on the MinION (between around 5,000 and 10,000) 
in less than an hour (Extended Data Fig. 6). In total, 142 samples were 
sequenced over 148 MinION runs during the 6-month period, pro- 
viding extensive coverage of reported cases in the outbreak (Fig. 2). 
Full details of samples and runs are in the Supplementary Data. We 
failed to generate amplicons for some samples, resulting in missing 
regions of the genome. Such samples often corresponded to those with 
a high RT-PCR cycle threshold (C,) value, suggestive of lower viral 
loads (Extended Data Fig. 7). For these we used a modified RT-PCR 
scheme using 19 shorter amplicons. We assumed that difficulties gen- 
erating long amplicons related to low numbers of starting molecules 
of that length in the original sample. We excluded 17 samples owing 
to quality control issues, for example single nucleotide polymorphism 
(SNP) calling sensitivity of less than 75%. We found that in-field per- 
formance of the system was comparable with validation experiments 
performed in the UK, suggesting that the system tolerated transporta- 
tion well (Extended Data Fig. 8). 

We combined our sequencing data set with 603 samples from other 
studies and inferred a time-scaled phylogenetic tree using the BEAST 
software package (Fig. 3). A maximum likelihood analysis and root- 
to-tip analysis showed good agreement between sampling date and 
root-to-tip divergence (Extended Data Figs 9 and 10a). We estimated a 
substitution rate of 1.19 x 107 (95% interval, 1.09 x 1073, 1.29 x 1077) 
of the combined data set (Extended Data Fig. 10b). This is consistent 
with rates from previous studies*~’. Results generated within the first 
10 days of starting real-time sequencing indicated that the persisting 
Guinean cases belonged to two major lineages, named GN1 and SL3, 
that had been established near the beginning of the epidemic (Fig. 3). 
Lineage GN1 is deeply branching from early cases in Guinea and has 
been infrequently seen in Sierra Leone’, suggesting that it has been 
largely confined to Guinea. The second lineage identified here was 
derived from lineage SL3 which was first detected in Sierra Leone by 
Gire et al.’, but was later seen circulating in Conakry towards the end 
of 2014°. Through integration of our data set with those generated by 
a different group operating in Sierra Leone we detected that both GN1 
and SL3 had also been seen in Sierra Leone early in 2015, suggestive of 
transmission between the countries'”. 

This work demonstrates a step change in our ability to per- 
form genomic surveillance prospectively during outbreaks under 
resource-limited conditions. However, numerous obstacles remain 
before such genomically informed investigations are routine. In practi- 
cal terms, we encountered significant logistical issues when performing 
this work, notably the absence of reliable, continuous mains electrical 
power, forcing a dependence on unreliable electrical generators and 
uninterruptable power supply (UPS) units, particularly for the bulky 
PCR thermocyclers. However, portable, battery-powered thermocyclers 
are in development, and isothermal approaches may be preferable for 
future work'®. By contrast, the MinION sequencer was unaffected by 
power outages and surges. We faced consistent issues with internet 
connectivity, which is currently required for analysis. There is a press- 
ing need for a fully offline version of the analysis presented here. 
This would reduce the dependence on high bandwidth connections. 
However it is likely that phylogenetic analysis will continue to be per- 
formed remotely (discussed further in the supplementary Field Guide 
to Portable Sequencing). In this analysis we focused on variant calling 
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approaches. A de novo approach to analysis would be preferable, but 
this would currently result in insertion and deletion errors due to poor 
resolution of homopolymeric tracts on the MinION. Our approach 
relies on amplification of genetic material before sequencing. In other 
epidemics, where the causative pathogen may be unidentified this is a 
drawback due to the need to have a priori knowledge of the pathogen 
genome sequence. In this event, sequencing directly from clinical mate- 
rial may be better, although sensitivity issues persist!>. 

Real-time genomic surveillance is a new tool in our arsenal to assist 
difficult epidemiological investigations, and to provide an international 
and environmental context to emerging infectious diseases. This may 
improve the efficiency of resource allocation and the timeliness of 
epidemiological investigations through genomically informed inves- 
tigations of transmission chains. Real-time genomic surveillance also 
increases the possibility of identifying previously unknown chains of 
transmission. By integrating in real time our data set with that of a 
second group performing sequencing in Sierra Leone, we identified 
evidence of frequent transmissions across the border with Guinea. 
Crucially, we released data at regular intervals throughout this project 
through Github, integrating our results with those of others, displayed 
interactively at http://ebola.nextstrain.org. We employed the Virological 
web forum to discuss complex cases (http://virological.org). This sys- 
tem will continue to support the West African epidemic response and 
will serve as a template for genomic surveillance of future outbreaks. 

The Ebola epidemic was officially declared to be over on 14 January 
2016 (http://www.who.int/mediacentre/news/releases/2016/ebola- 
zero-liberia/en/). Hours later, a new case of EVD was confirmed in 
Sierra Leone (http://who.int/mediacentre/news/statements/2016/ 
new-ebola-case/en/), confirming warnings that further flare-ups may 
be expected. Such cases pose pressing questions about their source 
that may be answered through genomic surveillance, by determining 
links to previously infected individuals’? and ruling out a new zoonotic 
spillover event. We now stand poised to answer such questions quickly. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

Ethics statement. The National Committee of Ethics in Medical Research of 
Guinea (permit no. 11/CNERS/14) approved the use of diagnostic leftover samples 
and corresponding patient data for this study. As the samples had been collected as 
part of the public health response to control the outbreak in West Africa, informed 
consent was not obtained from patients. 

Transportation. All equipment was loaded into a Pelican 1610 case (Pelican, 
Torrance, USA), cold chain reagents were packed into two polystyrene boxes with 
either ice or cool packs. These were sealed and placed in a holdall with the plastic 
consumables. Both pieces of luggage were flown by air as normal checked baggage. 
RNA extraction. RNA was extracted from 50 ul whole blood, 1401 serum, 140 
of resuspended swab or 140: urine using the QlAamp Viral RNA Mini Kit 
(Qiagen, Manchester, UK), following the manufacturer’s instructions. Samples 
were inactivated by adding 56011 of Buffer AVL (Qiagen) and 56011 of 100% 
ethanol while still in a glove box, this method has been shown to inactivate EBOV 
in blood samples”°. Following inactivation, samples were handled on the bench 
employing standard laboratory safety precautions. 

RT-PCR. Individual 25,11 RT-PCR reactions were performed using the 
SuperScript II One-Step RT-PCR System with Platinum Taq DNA Polymerase 
(Life Technologies Ltd, Paisley, UK). Each reaction was made up by adding 12.511 
2 x reaction mix, 1 pl enzyme mix, 1 ul primers (10|1M), 0.51 RNA extract and 
nuclease-free water. Thermocycling was performed on an Eppendorf Master Cycler 
Personal instrument with the following program: 60°C for 30 min, 94°C for 2 min 
followed by 45 cycles of 94°C for 15s, 55°C for 30s, 68°C for 2 min and a final 
extension of 68 °C for 5 min. 

MinION library preparation. Each reaction was quantified on a Qubit 3.0 
fluorimeter using the dsDNA HS assay (Life Technologies). Equimolar amounts 
of each amplicon product to a total DNA mass of 1 1g was pooled into a single 
tube and cleaned-up using an equal volume of MAGBIO HighPrep PCR beads 
(AutoQ Biosciences, Reading, UK). Pooled amplicons were diluted to 8511, and 
end-repaired in a total volume of 100,11, using the NEBNext End Repair Module 
(New England Biolabs, Hitchin, UK) before being cleaned up using an equal vol- 
ume of HighPrep PCR beads and eluting in 251] nuclease-free water. 3’ dA-tailing 
was performed using the NEBNext dA-Tailing Module (New England Biolabs) 
in a volume of 30 11, before being cleaned up using an equal volume of HighPrep 
PCR beads and eluting in 3011 nuclease-free water. 10,11 of ‘Adaptor mix’ and 
10,11 ‘HP adaptor’ supplied in the SQK-MAP005 library preparation kit (Oxford 
Nanopore Technologies, Oxford, UK) were added to the dA-tailed amplicons 
along with 50 11, Blunt/TA Ligase Master Mix (New England Biolabs) in a Protein 
LoBind tube (Eppendorf UK) and incubated for 10 min. The resulting sequencing 
library was purified using Dynabeads His-Tag Isolation and Pulldown beads (Life 
Technologies, Stevenage, UK) according the SQK-MAP005 protocol supplied by 
Oxford Nanopore Technologies as part of the MinION Access Program. The final 
library was quantified using the Qubit to confirm the process had been successful. 
641, of library was diluted using 7511 ‘2x Running Buffer’ 66,11 Nuclease-free water 
(Promega UK, Chilworth, UK) and 3 11, “Fuel Mix. 

MinION sequencing. A new flowcell was unpackaged and fitted onto the MinION 
device. The flowcell was primed with a blank sample created as described above, 
and left to incubate for 10 min. The priming process was repeated a second time 
before the sample was loaded. Running MinKNOW version 0.49.2.9 and starting 
the protocol ‘MAP_48Hr_Sequencing_Run.py’ initiated the sequencing run. An 
offline-capable version of MinKNOW, with internet ‘ping’ disabled and online 
updates disabled was made available to us by Oxford Nanopore Technologies 
specifically for the project (available on request from Oxford Nanopore 
Technologies). 

Data transfer. With no method of offline analysis available during the outbreak 
period, there was a dependency on local internet connectivity to facilitate the 
upload of the raw FASTS files produced by MinKNOW. A variety of methods 
were used depending on location and circumstances with the vast majority of the 
data being uploaded from the European Mobile Laboratories staff accommodation 
in Coyah, Guinea, via a mobile internet 3G hotspot (TP-LINK M5350 3G hotspot 
on the MTN mobile network). At times due to unknown factors the upload speed 
was limited to 2G and took significantly longer. Using Cygwin version 2.0.0 and the 
Linux tar command a compressed archive containing the first 5,000 to 10,000 .fast5 
read files generated by each run was created. This was uploaded to a Google Drive 
shared directory. Eventually in Coyah we were provided access to a broadband 
connection (MTN network, 5 Mbs~', established by the World Food Program), 
which proved to be more reliable than mobile internet. 

Data handling. Data was downloaded onto a Linux server on the MRC Cloud 
Infrastructure for Microbial Bioinformatics located in Birmingham, UK. Files 
were unpacked and basecalled using the Metrichor command-line interface and 
the workflow 2D Basecalling for MAP-005 (versions 1.14, 1.24 and 1.34). This 
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software was provided by Oxford Nanopore Technologies (available on request) 
for the project in order to permit basecalling to be carried out through the Linux 
command line as part of a pipeline. The MinION generates one direction (1D) and 
two direction reads (2D). 2D reads are higher quality and were used for analysis. 
2D reads that were in the pass filter folder and 2D reads designated as high-quality 
(due to having more complement events than template events) in the fail folder as 
determined by poretools were extracted into FASTA (for nanopolish) and FASTQ 
format (for marginAlign) with poretools version 0.5.17). 

Bioinformatics analysis. We use a reference mapping approach to detect 
single nucleotide variants through alignment to a reference strain from early in 
the outbreak (GenBank accession number EM_079517)!"". Due to the nature of the 
sequencing data, which is dominated by insertion and deletion errors, we do not 
attempt to call insertion or deletions". Variants were detected using the variants 
module of the nanopolish software package. Initial nucleotide base alignment was 
carried out with MarginAlign’*. Nanopolish then uses the event-level (‘squiggle’) 
data generated by the MinION to evaluate candidate variants found in the aligned 
reads as described in the following section. Variants with a log likelihood ratio of 
>200 and coverage depth of >50x (25x 2D coverage) are accepted and a con- 
sensus sequence is generated for each sample. Regions of uncertainty (for exam- 
ple in difficult to sequence homopolymeric regions or primer binding sites), or 
with low coverage (<50x, or 25x 2D coverage) are masked with an N character. 
Assuming sufficient genomic coverage is present over a specific amplified variant 
this approach gives a high true positive variant calling rate. However, failure of 
individual amplicons to amplify, or unbalanced coverage of regions may reduce 
this figure. This is assessed, on each individual sample, by artificially mutating the 
reference genome with 30 randomly chosen mutations. Mutated positions in the 
references should be detected as variants, using the simplifying assumption that 
these variants are unlikely to be present in the sample. Any positions not covered by 
the tiling amplicon scheme (that is, the extreme 5’ and 3/ ends) are not considered 
in the true positive rate calculation. Each sample is therefore assigned a quality 
indicator. Those with a true positive rate (TPR, that is, sensitivity) of >75% are 
included in phylogenetic inferences. Samples with TPR <75% were not used for 
the phylogenetic analysis presented here. 

Signal-based SNP calling. SNPs were called using the “variants” module from the 
nanopolish package (manuscript in preparation, https://github.com/jts/nanopolish, 
branch snp_calling_alternative_models, commit ID 25ea7bac3ab9e1d266079ac 
105ab2005cfa39a14). 

The nanopolish variants program first finds candidate SNPs by finding mis- 
matches between the aligned nanopore reads and the reference genome. These can- 
didate SNPs are clustered into sets of nearby SNPs, an exhaustive set of candidate 
haplotypes are derived from the possible combinations of SNPs and the haplotype 
that maximizes the probability of the event-level data called as the sequence for 
region. We describe each step in detail below. 

Candidate SNP generation. We iterate over the entire reference genome and 
examine positions covered by at least 20 nanopore reads. At these well-covered 
positions we considered any non-reference base that was seen in at least 20% of 
the nanopore reads to be a candidate SNP. These candidates were passed to the 
next stage of the pipeline. 

Candidate haplotype generation. As the MinION sequencer does not measure 
single bases, but rather current signals dependent on a short sequence of nucleo- 
tides that are in the pore, we could not assess each SNP individually. Instead, we 
partitioned the set of candidate SNPs into groups whose signals may interact and 
overlap. We determined that SNPs separated by at least 10 bp could be treated 
independently; therefore we partitioned the candidate SNP set into subsets of SNPs 
that are within 10 bp of each. For each subset of candidate SNPs we exhaustively 
generated all possible haplotype sequences by including/excluding the individual 
SNPs in the subset. As the number of possible combinations of n SNPs is 2”, we 
had to discard subsets that contained more than 10 candidate SNPs or spanned a 
reference region greater than 100 bp. For each derived haplotype sequence S, we 
calculate the likelihood of S using a modified version of the hidden Markov model 
(HMM) we previously described!®. 

Haplotype likelihoods. The nanopolish HMM calculates the probability of observ- 
ing a sequence of events emitted by the nanopore, which we denote as D, given 
an arbitrary sequence S. The structure of the HMM is as previously described but 
now allows events to be “soft-clipped” to better handle uncertainty about where 
the event-to-sequence alignment starts and ends. In addition, we incorporated a 
new model from Oxford Nanopore that models the event signals to be dependent 
on six-base-pair subsequences rather than five-base-pair subsequences. To use this 
model on SQK-MAP-005 data we calculated a global shift parameter (shift_offset) 
that rescales SQK-MAP-005 data to the 6 bp emission functions. We otherwise 
did not train the emission functions, per-read scaling parameters or transition 
probabilities of our hidden Markov model. 
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Variant calls. For each subset of candidate SNPs, the haplotype with the largest 
likelihood is called as the sequence for the region. The SNPs contained on the called 
haplotype (if any) are output in VCF format. The log likelihood ratio between the 
called haplotype and the reference haplotype (containing no SNPs) was output as 
the score for each variant to facilitate downstream filtering. Metadata such as the 
total depth of the region and the number of reads that support the called haplotype 
over the reference sequence is also output. 

Validation experiments. Dstl amplicons. Archived Zaire Ebolavirus was amplified 
using 38 primer pairs, giving approximately 500 base pair amplicons, according to 
the study protocol. As this work was before in-field sequencing, different versions 
of the MinKNOW software and Metrichor basecaller were used. Amplicons were 
sequenced by both MinION and Illumina. An Illumina library was constructed 
from the same amplicon pool and tagmented using the Nextera XT library prepa- 
ration kit. The library was sequenced on the Illumina MiSeq. Because of the huge 
excess of coverage generated, this data set was subsampled to 400,000 paired reads 
before aligning to the EM_079517 reference sequence using BWA-MEM~. After 
sorting and converting the resulting alignment to BAM using samtools, variants 
were determined using FreeBayes”*. A consensus sequence was generated using 
the vcf2fasta component of veflib (https://github.com/ekg/vcflib). The MinION 
data was analysed as per the study methods, except for a modification to nanop- 
olish to allow it to consider up to 15 variants per segment in order to account for 
the increased divergence between the genome and the reference. The MinION 
and Illumina consensus sequencs were aligned using the nucmer component of 
MUMMER and variants determined using the show-snps module”. Scripts and 
documentation for this analysis are in the Github notebook Dstl validation.ipynb. 
180 genome analysis. Six samples of leftover RNA from a previously performed 
sequencing study* were processed at Public Health England Porton Down, as 
per the methods described in the manuscript. One sample did not yield any 
sequenceable products, so five genomes (EM_076534, EM_076533, EM_076383, 
EM_078416, EM_076769) were sequenced on MinION at PHE Porton Down. 
The 11 reaction scheme was used except for sample EM_076769 when the 19 reac- 
tion scheme was used. These sequences were compared with Illumina consensus 
sequences from the previously published data set in Carroll et al.°. Variants were 
identified between the reference genome (EM_079517) and each of the success- 
fully samples using the show-snps component of MUMMER™. Variants detected 
by our pipeline were compared against expected variants, before and after qual- 
ity filtering, using custom Python scripts deposited in the Github repository and 
documented in the IPython Notebook. A phylogeny was inferred using RaXML”° 
including the consensus sequences from the validation set along with all of the 
consensus sequences from Carroll et al.? MinION sequence accuracy rates for 
two-direction (2D) reads were determined using A. Quinlan’s count-errors.py 
script (http://github.com/arq5x/nanopore-scripts) as described in Quick et al.!. 
Scripts and documentation for this analysis are in the Github notebook: Examine 
validation runs.ipynb. 

Analysis of SNP calling sensitivity. Reads were subsampled at collection time 
intervals using the poretools times command”, simulating the order reads are 
obtained by real-time sequencing on the nanopore, to demonstrate the effect of 
coverage on SNP calling sensitivity and log likelihood ratio. 

Analysis of samples from the same patient. Samples were analysed as part of the 
real-time surveillance work. The consensus sequences from four pairs of samples 
each from four individuals were generated. Each pair was compared individually 
using the show-snps module of MUMmer to investigate differences. 


Detection of putative transmission events from Sierra Leone. We downloaded 
the 74 genome sequences made available on http://virological.org (http://virological. 
org/t/direct-deep-sequencing-in-sierra-leone-yields-73-new-ebov-genomes-from- 
february-may-2015/134) and aligned them against sequences from our analysis 
using MUSCLE”*. We then generated a phylogenetic tree using FastTree 2 with the 
GTR model’’, Any sequences that fell into the GN1 or SL3 lineages were included 
in future analysis. 

Phylogenetic inferences. Consensus sequences from real-time sequencing were 
aligned with previously published genome sequences from Guinea, Sierra Leone 
and Liberia’. To address the over-representation of Sierra Leone sequences in 
this set we randomly down-sampled available sequences, resulting in a total of 
313 sequences from Sierra Leone. Maximum likelihood trees are produced using 
RAxML 8.2.3 using the GTRGAMMA model and 200 bootstrap replicates”. Time- 
scaled trees were produced with BEAST v1.8.27* using a HKY+gamma substitution 
model”? partitioned by first, second and third codon positions and intergenic 
regions, a Skygrid tree prior*! and an uncorrelated lognormal clock”, and an 
uninformative prior on the mean of the molecular clock rate (XML file in the 
accompanying Github repository). The maximum clade credibility tree was 
recovered using TreeAnnotator. Phylogenetic trees were annotated using the ete3 
Python package. 

Code availability. Reproducible workflows for the analysis presented here and 
consensus sequences can be found at http://github.com/nickloman/ebov and 
are freely available under the MIT license. The complete set of bioinformatics 
scripts are available in a Github repository with associated [Python Notebooks to 
regenerate the figures and tables presented in this manuscript can be found at 
http://github.com/nickloman/ebov. 
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Extended Data Figure 1 | Primer schemes employed during the study. 
We designed PCR primers to generate amplicons that would span the 
EBOV genome. a, We initially designed 38 primer pairs which were 
used in the initial validation study and which cover >97% of the EBOV 
genome. During in-field sequencing we used a 19-reaction scheme or 
11-reaction scheme, which generated longer products. The predicted 


amplicon products are shown with forward primers and reverse primers with Biopython*’. 
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indicated by green bars on the forward and reverse strand, respectively, 
scaled according to the EBOV virus coordinates. b, c, The amplicon 
product sizes expected are shown for the 19-reaction scheme (b) and the 
11-reaction scheme (c). No amplicon covers the extreme 3’ region of the 
genome. The last primer pair, 38_R, ends at position 18578, 381 bases 
away from the end of the virus genome. The primer diagram was created 
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A. Equipment 


C. Reagents 


Item Number | Model Reagent Shipping Supplier 
Condition 
Thermocycler 1-3 MasterCycler Personal (Eppendorf) 
Nuclease-Free Water Ambient Qiagen 
Fluorometer 1 Qubit 3.0 (Life Technologies) 
Ethanol 100% Ambient - 
Laptop 2-3 NT310-H (Stone) 
HighPrep PCR Chilled MAGBIO 
MinlION 2-3 - 
Dynabeads His-Tag Isolation Chilled Life Technologies 
Pipettes 6 P2, 10, 20, 100, 200, 1000 (Gilson) and Pulldown 
Microfuge 1-2 Oligos Chilled Sigma 
Dry bath 1 Mini Dry Bath Incubator (Starlab) Qubit dsDNA HS Assay Kit Chilled Life Technologies 
Magnetic rack | 1 MagnaRack (Life Technologies) MinlON Flowcells Chilled Oxford Nanopore 
: Technologies 
Power strip 1 Dependent on country 
NEBNext End-Repair Module Frozen New England Biolabs 
NEBNext dA-Tailing Module Frozen New England Biolabs 
Blunt/TA Ligase Master Mix Frozen New England Biolabs 
SuperScript Ill One-Step RT- Frozen Life Technologies 
PCR System with Platinum 
Taq DNA Polymerase 
SQK-MAPO05 Frozen Oxford Nanopore 
Technologies 


B. Consumables 


Item Supplier 
DNA LoBind Tubes (2 ml) Eppendorf 
Protein LoBind Tubes (2 ml) Eppendorf 


Qubit Assay Tubes Life Technologies 
PCR Tubes with Flat Caps (0.2 ml) Starlab 

Pipette Tips (10 ul, 20 ul, 100 pl, 200 ul, Sarstedt 

1000 ul) 

Nitrile Gloves Kimberly Clark 


Professional 


Extended Data Figure 2 | List of equipment and consumables to 
establish the genome surveillance system. a—c, We show the list of 
equipment (a), disposable consumables (b) and reagents (c) to establish 
in-field genomic surveillance. Sufficient reagents were shipped for 


20 samples. MinION sequencing requires a mix of chilled and frozen 
reagents. Recommended shipping conditions are specified. The picture 
underneath depicts MinION flowcells ready for shipping with insulating 
material (left) and frozen reagents (right). 
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Extended Data Figure 3 | Bioinformatics workflow. This figure summarizes the steps performed during bioinformatics analysis (ordered from top to 
bottom), in order to generate consensus sequences. The right column shows the example software command executed at each step. 
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Sample | Reaction Total True False False negatives 
scheme mutations positives positives 
Masked Wrong 
Outside Primer Erroneous 
amplicon | binding | reference 
scheme site call 
Pre-quality filtering (all variants) 
076533 11 reactions 20 19 1 1 - - 
076383 11 reactions 18 16 1 1 a - 
078416 | 11 reactions 18 17 1 - 1 - 
076769 19 reactions 19 19 0 - 7 - 
076533 11 reactions 20 19 0 1 - 7 
076383 11 reactions 18 16 0 1 1 Z 
078416 11 reactions 18 17 0 - 1 = 
076769 19 reactions 19 19 0 - - - 
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Extended Data Figure 4 | Results of MinION validation. a, The results 
of comparing four MinION sequences with Illumina sequences generated 
as part of a previous study? are shown. Each row in the table demonstrates 
the number of true positives, false positives and false negatives for a 
sample. False negatives may result in masked sequences, owing to being 
outside of regions covered by the amplicon scheme, having low coverage 
or falling within a primer binding site. Results before and after quality 
filtering (log likelihood ratio of >200) are shown. After quality filtering, 


no false positive calls were detected. All detected false negatives were 
masked with Ns in the final consensus sequence. No positions were 
called incorrectly. b, The four consensus sequences, plus an additional 
sample that had missing coverage in one amplicon are shown as part of 
a phylogenetic reconstruction with genomes from Carroll et al.*. Sample 
labels in red, blue, pink, yellow and blue represent pairs of sequences 
generated on MinION and Ilumina. These fall into identical clusters. 
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Extended Data Figure 5 | Relationship between coverqage and log- 
likelihood ratio for sample 076769. Line-plot showing the relationship 
between sequence depth of coverage (x axis) and the log likelihood ratio 
for detected SNPs derived by subsampling reads from a single sequencing 
run to simulate the effect of low coverage. The horizontal and vertical line 
indicates the cut-offs (quality and coverage respectively) for consensus 


calling. Therefore, all variants are detected below 25 coverage, and the 
vast majority meet the threshold quality at 25 x coverage or slightly above. 
Any combination of log likelihood ratio or coverage that placed variants 
in the grey box would be represented as a masked position in the final 
consensus sequence. 
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Extended Data Figure 6 | Duration of MinION sequencing runs. For each sequence run the sequencing duration, measured as the difference between 
timestamp of the first read seen and the last read transferred for analysis. 127 runs are shown, with 15 outliers with duration greater than 200 min excluded. 
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Extended Data Figure 7 | Histogram of C; values for study samples. C; values for samples in the study (where information was available) ranged 
between 13.8 and 35.7, with a mean of 22. 
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Extended Data Figure 8 | Sequence accuracy for samples. a, b, Accuracy 142 samples from real-time genomic surveillance (b). Accuracy is defined 
measurements for the entire set of two-direction reads were made for the according to the definition from Quick et al.''. Vertical dashed lines 
validation samples, sequenced in the United Kingdom (a) and each of the indicate the mean accuracy for the sample. 
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reflects country of origin. a—c, the entire data set is shown (a), with 
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Extended Data Figure 10 | Root-to-tip divergence plot and mean per site per year) across the EBOV phylogeny recovered using BEAST 
evolutionary rate estimate. a, Root-to-tip divergence plot for the 728 under a relaxed lognormal molecular clock. Blue area corresponds to 
Ebola samples generated through maximum likelihood analysis. Samples the 95% highest posterior density (HPD) (mean of the distribution is 
from real-time genomic surveillance are coloured as per Fig. 3 and 1.19 x 107°, 95% HPDs: 1.09-1.29 x 10~? substitutions per site per year). 
Extended Data Fig. 9. b, Mean evolutionary rate estimate (in substitutions Hatched regions in red are outside the 95% HPD intervals. 
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Structure- and function-based design of 
Plasmodium-selective proteasome inhibitors 


Hao Li!+, Anthony J. O’Donoghue*}, Wouter A. van der Linden!, Stanley C. Xie*, Euna Yoo?, Ian T. Foe, Leann Tilley’, 


Charles S. Craik*, Paula C. A. da Fonseca® & Matthew Bogyo? 


The proteasome is a multi-component protease complex 
responsible for regulating key processes such as the cell cycle and 
antigen presentation!. Compounds that target the proteasome are 
potentially valuable tools for the treatment of pathogens that depend 
on proteasome function for survival and replication. In particular, 
proteasome inhibitors have been shown to be toxic for the malaria 
parasite Plasmodium falciparum at all stages of its life cycle~>. Most 
compounds that have been tested against the parasite also inhibit 
the mammalian proteasome, resulting in toxicity that precludes 
their use as therapeutic agents”*. Therefore, better definition of the 
substrate specificity and structural properties of the Plasmodium 
proteasome could enable the development of compounds with 
sufficient selectivity to allow their use as anti-malarial agents. To 
accomplish this goal, here we use a substrate profiling method 
to uncover differences in the specificities of the human and 
P. falciparum proteasome. We design inhibitors based on amino-acid 
preferences specific to the parasite proteasome, and find that they 
preferentially inhibit the 82-subunit. We determine the structure of 
the P. falciparum 20S proteasome bound to the inhibitor using cryo- 
electron microscopy and single-particle analysis, to a resolution of 
3.6 A. These data reveal the unusually open P. falciparum (32 active 
site and provide valuable information about active-site architecture 
that can be used to further refine inhibitor design. Furthermore, 
consistent with the recent finding that the proteasome is important 
for stress pathways associated with resistance of artemisinin family 
anti-malarials”*, we observe growth inhibition synergism with low 
doses of this 32-selective inhibitor in artemisinin-sensitive and 
-resistant parasites. Finally, we demonstrate that a parasite-selective 
inhibitor could be used to attenuate parasite growth in vivo without 
appreciable toxicity to the host. Thus, the Plasmodium proteasome 
is a chemically tractable target that could be exploited by next- 
generation anti-malarial agents. 

The Plasmodium proteasome represents a viable target for anti- 
malarial drugs owing to its essential nature in protein turnover and 
the parasite’s need to rapidly divide inside host cells®. We have previ- 
ously shown that differences exist in the reliance of human cells and 
P falciparum on specific proteasome activities for survival'®. Therefore, 
we reasoned that a structural and biochemical understanding of this 
enzyme complex could yield compounds with desirable safety profiles 
due to their selectivity for the parasite proteasome. We purified the 
P. falciparum 20S proteasome"", activated it with human PA28a!*!3 
(Extended Data Fig. 1a), and determined the substrate preferences of 
this activated proteasome by monitoring the degradation pattern of 
228 diverse synthetic tetradecapeptides using liquid chromatography- 
tandem mass spectrometry ”!° . A frequency profile was generated 
using iceLogo software’®, indicating which amino acids were most and 


least favoured in the subsites surrounding each cleaved bond. In parallel, 
we generated a substrate specificity profile for the human 20S/PA28a 
proteasome under identical conditions. We detected 284 and 328 cleav- 
age sites for the activated P falciparum and human 20S proteasomes, 
respectively, with 171 sites being cleaved by both (Fig. 1a). However, 
113 and 157 sites were uniquely cleaved by P falciparum and human 
proteasomes, respectively (Fig. la-c and Extended Data Fig. 1b, c), with 
the major differences occurring on the amino (N)-terminal side of the 
cleavage site, namely at the P1 and P3 sites. For the P1 position which 
is directly adjacent to the cleavage site, the himan proteasome showed 
chymotrypsin-like (Leu/Phe/Tyr), trypsin-like (Arg), and caspase-like 
(Asp) activity (Fig. 1b), consistent with previous studies!”, while the 
P. falciparum proteasome showed strong preference for aromatic 
residues (Fig. 1c). Direct comparison of the specificity profiles of both 
proteasomes revealed a clear preference for Trp at P1 and P3 in the 
parasite proteasome (Fig. 1d and Extended Data Fig. 1b, c). 

Using the canonical tri-leucine scaffold found in common 
proteasome inhibitors such as MG132 and Z-L3-VS!*), we systemat- 
ically replaced the Leu residues at the P1 and P3 positions with Trp, 
resulting in the compounds LLW-vs, WLL-vs and WLW-vs (Fig. le 
and Extended Data Fig. 2a). We treated purified P. falciparum 20S 
proteasomes with each compound and measured residual activity of 
the catalytic sites using an activity-based probe"® (Fig. 1f). Changing 
the P1 position to Trp (LLW-vs) markedly reduced inhibition of the 
85 active site, without altering the activity towards the $2-subunit, 
while Trp in the P3 position (WLL-vs) resulted in potent inhibition 
of both 82 and (5 activities. Finally, Trp substitution in both P1 and 
P3 positions (WLW-vs) resulted in a potent inhibitor that was highly 
selective for the parasite proteasome 32-subunit. With the exception 
of LLW-vs, these compounds inhibited the human 35-subunit while 
all compounds only weakly inhibited the human 81 and 32 activities 
(Fig. 1g and Extended Data Figs 1d and 2b). Therefore, these inhibitors 
primarily differ in their selectivity towards the 82-subunits and this 
selectivity was retained in human and P falciparum inhibitor treatments 
in live culture (Extended Data Fig. 2c). 

We investigated the structural basis for the 32 selectivity of the 
P. falciparum 20S proteasome towards WLW-vs by high-resolution 
cryo-electron microscopy (cryo-EM) and single-particle analysis, using 
a strategy similar to that which we previously used to determine the 
structure of an inhibitor-bound human 20S proteasome””. The resolu- 
tion of the final map was estimated at 3.6 A (Fig. 2 and Extended Data 
Fig. 3a—e), with local resolution decreasing for solvent exposed sur- 
faces (Fig. 2a, b). In our map, sheet-forming (-strands are well resolved 
(Fig. 2d) and side chains are well defined for both 6-strands and 
a-helices (Fig. 2e, f), particularly when stabilized by intra-protein 
contacts or steric restraints. We were able to unambiguously assign a 
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Figure 1 | Substrate profile of the activated human and P. falciparum 
20S proteasome guides inhibitor design. a, Total number of cleavage sites 
detected after incubation for 4 h of the activated human and P falciparum 
proteasome with the peptide library. b, c, The iceLogos generated from 
the cleavages are shown for human (b) and P. falciparum (c) proteasome. 
Amino acids that are most and least favoured at each position are shown 
above and below the axis, respectively. Lower-case ‘n, norleucine; amino 
acids in black text are statistically significant (P < 0.05, unpaired two- 
tailed Student's t-test). d, The Z-score for amino acids at each position 
(P4—P4’) was calculated for both the human and the parasite proteasome 
on the basis of the cleavages in a, and the difference between the Z-scores 


clear density extending from the 32-subunit N-terminal threonine to 
the WLW-vs inhibitor, with the vinyl sulfone group and the WLW side 
chains clearly resolved in a 8-strand secondary structural conformation 
(Fig. 2g). 

When the protein coordinates for the P. falciparum B1- and 85- 
subunits are aligned to those of the ligand-bound (2-subunit, the 
resulting superimposition of the WLW-vs ligand into the $1 and $5 
active sites clearly reveals space constrains for its binding. While the 
more open conformation of the 82 binding pocket easily accommo- 
dates the WLW-vs side chains (Fig. 3a), the ligand overlay into the 
B1 and 85 active sites results in the burying of its Trp side chains into 
the protein van der Waals surface (Fig. 3b, c), with the exception 
of the P3 position in the (35 active site (Fig. 3c). We extended this 
comparison to the protein coordinates of the human 20S proteas- 
ome active sites, using the coordinates of the human apo complex”! 
(Protein Data Bank (PDB) accession 4R30). Superimposing the 
WLW-vs inhibitor onto the human apo proteasome core (Fig. 3d-f) 
shows its active sites have no accessibility for the P1 and P3 Trp 
side chains of the inhibitor, with the exception of the P3 position at 
the 35 active site (Fig. 3f). Similar observations can be made when 
using the protein model of the human proteasome core bound to the 
inhibitor AdaAhx3-LLL-vs (PDB 5A0Q) (Extended Data Fig. 3f-h). 
This structural comparison of the accessibility of P falciparum and 
human proteasome active sites is consistent and explains the observed 
specificity of the WLW-vs inhibitor towards the P falciparum 82- 
subunit, which would be difficult to predict without direct high- 
resolution structural information. The results presented here show 
an important and novel application of high-resolution cryo-EM to 
interpret the basis for the specific binding of a ligand to an active site, 
where the information obtained can be harnessed to guide further 
ligand development for therapeutic usage. 

P. falciparum parasites can become resistant to killing by the front- 
line artemisinin (ART) drug family??-*4, however, these ART-treated 
parasites are particularly sensitive to disruption of normal proteas- 
ome function”®. We previously showed that inhibition of the proteas- 
ome 35-subunit is required for attenuation of P. falciparum growth”. 
However, we reasoned that a sublethal dose of the Plasmodium 
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is shown as a heatmap. e, Inhibitors are designed by substituting Trp at 
either the P1 and/or P3 positions in the morpholino-capped tri-leucine 
vinyl sulfone. Residue 1 (Rj) and 2 (R:) refer to the amino acids Leu or 
Trp. f, Inhibition of purified P. falciparum 20S as assessed by activity- 
based probe labelling. The same experiment was repeated for the human 
20S proteasome (Extended Data Fig. 2b). g, Activity of each subunit in 
human or P falciparum proteasome after 10|1M inhibitor treatment was 
determined by image quantification of the intensity of probe labelling 
and normalized to mock-treated control. Error bars, s.d. n =3 purified 
proteasome from three independent experiments (for gel source data, see 
Supplementary Fig. 1a, b). 


82-selective inhibitor WLW-vs may still be sufficient to enhance kill- 
ing by ART in both sensitive and resistant parasites. When laborato- 
ry-adapted clinical isolates were treated at the early ring stage of the 
intraerythrocytic cycle, ART-resistant parasites (PL7) showed close to 
twofold higher sensitivity to WLW-vs than ART-sensitive parasites (PL2) 
(Fig. 4a). In addition, we observed synergism between dihydroartemi- 
nisin (DHA) and WLW-vs at concentrations where WLW-vs selectively 
inhibits the 82-subunit of the Plasmodium proteasome (Fig. 4b and 


LL 
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Figure 2 | Structure of the P. falciparum 20S proteasome core bound 
to the inhibitor WLW-vs, determined by cryo-EM and single-particle 
analysis. a, The cryo-EM map is surface rendered and colour coded 
according to local resolution as determined by ResMap. b, Sections of the 
map as shown ina, revealing the higher resolution at the internal regions 
of the complex, compared with solvent exposed surfaces. c, Section of the 
cryo-EM map, as mesh representation, showing the overall fitting of the 
protein model. d-g, Detailed views of the map showing clear separation 
of sheet-forming 3-strands (d), good recovery of side chains, both in 
8-strands (e) and a-helices (f), and the densities for the WLW-Vvs inhibitor 
at the proteasome [32 active site (g). 
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Figure 3 | Structural comparison of the P. falciparum and human 
proteasome 20S core active sites. a, Coordinates of the P falciparum 
proteasome (32 active site bound to WLW-vs. The inhibitor P1, P2 and P3 
positions are indicated. b, c, Coordinates of the P. falciparum proteasome 
681 (b) and 85 (c) active sites with superimposed coordinates of the 
WLW-vs inhibitor, as shown in a. d-f, Coordinates of the apo human 
proteasome 20S core (PDB 4R30) (2 (d), 81 (e), and (5 (f) active sites, 
with superimposed coordinates of the WLW-vs inhibitor, as shown in a. 
In all parts the protein is represented as van der Waals surfaces and the 
inhibitor as sticks. 


Extended Data Fig. 4a). As parasites are normally insensitive to a pulse 
inhibition of the proteasome (32 active site alone’®, this emphasizes the 
heightened sensitivity of P falciparum to proteasome inhibition in the 
presence of DHA. 

We have previously shown that co-inhibition of the Plasmodium 
proteasome 32- and 35-subunits results in complete parasite growth 
attenuation at all stages of its blood cycle, even after short inhibitor 
treatment times!°. WLL-vs is a potent inhibitor of the parasite 82 
and 35 active sites, while only effective against the human (35 activity 
(Figs 1g, 3 and Extended Data Fig. 1d, e). WLL-vs is therefore likely 
to have potent anti-malarial activity with minimal toxicity to the 
host, since a short inhibition of host (5 activity is well tolerated in 
non-transformed mammalian cells”*. We treated live P. falciparum cul- 
tures and primary human fibroblasts for 1 h with WLL-vs, and were 
surprised to find that WLL-vs inhibited all three proteasome catalytic 
sites in P. falciparum, while having little effect on any of the human sub- 
units until high micromolar concentrations (Fig. 4c). This increased 
potency is probably due to inhibitor accumulation in the intact 
parasites and enhanced inhibition of the parasite 26S proteasome 
(20S with regulatory subunit 19S proteasome complex)’. This trans- 
lated into a large therapeutic window for the WLL-vs inhibitor in 1h 
and 72h viability assays (Fig. 4d and Extended Data Fig. le). Although 
the P. falciparum proteasome inhibition corresponds well to the 
decrease in parasite survival (Fig. 4d), we sought to determine whether 
off-target toxicity was occurring. First, a diastereomer of WLL-vs 
containing a P1 p-Leu (WL(d)L-vs) showed a dramatic reduction in 
compound potency (Extended Data Fig. 5a-c), indicating that these 
compounds were not indiscriminately cytotoxic. Second, vinyl-sulfones 
can react with cysteine proteases”®; therefore, we assessed reactivity of 
WLL-vs towards P falciparum cysteine proteases using the activity- 
based probe BODIPY-TMR-DCG04 (ref. 29). We observed inhibition 
of cysteine proteases with WLL-vs only at concentrations that were 
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Figure 4 | Exploiting differences in the 82-subunits of the two 
proteasome species for selective parasite killing. a, b, 32-subunit- 
selective inhibitor WLW-vs can synergize with DHA treatment in ART- 
resistant P. falciparum. a, Viability 72 h after treatment for 1 h of ART- 
sensitive (PL2) and -resistant (PL7) parasites at early ring stage (2h after 
invasion) with WLW-vs (abbreviated as WLW). b, Top, dose-response to 
DHA in the absence and presence of sublethal doses of WLW-vs. Bottom, 
isobologram of the two inhibitors at the 50% lethal dose. PL7 parasites 
were treated for 3 h at early ring stage (4h after invasion). a, b, Data shown 
are individual data points (two technical replicates for a), representative of 
three independent experiments. c-e, WLL-vs co-inhibits 82 along with 
the other two catalytic subunits, resulting in potent parasite killing. 

c, Human foreskin fibroblasts (HFFs) or P. falciparum schizonts were 
treated with WLL-vs for 1 h, and activity-based probe labelling was 
performed post-lysis (for gel source data, see Supplementary Fig. 1c, d). 
Fluorescent gel scan (top) and silver stain (bottom) are shown. d, Non- 
confluent, replicating HFFs (n = 9 cell cultures, from three independent 
experiments of triplicates) or P. falciparum at ring stage (n = 6 parasite 
cultures, from two independent experiments of triplicates) was pulse- 
treated for 1h with WLL-vs and viability was determined after 71h. 
Activities of the P falciparum proteasome subunits after inhibitor 
treatment were determined as described in Methods (n = 3 parasite lysates 
from three independent experiments). Error bars, s.d. e, Balb/c female 
mice were infected with P. chabaudi and treated with a single intravenous 
dose of vehicle (n = 6 mice) or 35mg kg! WLL-vs (n= 6 mice) on day 3 
after infection (arrow). Parasitaemia and weight of the mice were 
monitored daily until the infection was resolved naturally by the host 

(day 7 onwards). Error bars, s.d. 


>tenfold over the 1h treated half-maximum effective concentration 
(ECs) (Extended Data Fig. 5d). Third, we assessed parasite morpho- 
logy 24 h after a 1h WLL-vs pulse treatment and observed arrested 
parasite development and eventual parasite death at high inhibitor 
concentrations (Extended Data Fig. 5e). The absence of a food vacuole 
defect indicated that the falcipains in this digestive organelle were not 
inhibited. Taken together, these data confirm that WLL-vs exerts its 
anti-malarial effects through direct proteasome inhibition. 
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Given the large therapeutic window of WLL-vs, we assessed its effi- 
cacy in a rodent Plasmodium chabaudi infection model. We infected 
Balb/c female mice with 1 x 10° parasites and treated them 3 days later 
using a single bolus dose of 35 mgkg~! WLL-vs or vehicle, by tail vein 
injection. We observed almost complete reduction of parasite bur- 
den after WLL-vs treatment (Fig. 4e) and good tolerance to the drug 
as assessed by appearance, activity, and weight retention (Extended 
Data Fig. 4b). The efficacy of this treatment was further confirmed 
in an intraperitoneal single bolus dosing study of mice infected with 
P. chabaudi (Extended Data Fig. 4c). 

In summary, we have shown through biochemical and structural 
studies that the Plasmodium proteasome is sufficiently unique from 
the human proteasome for selective targeting. Furthermore we show 
that even sublethal doses of proteasome inhibitor can lead to para- 
site sensitization to ART. Our lead molecule, WLL-vs, is a mecha- 
nism-based peptidic inhibitor that is chemically similar to the US Food 
and Drug Administration-approved drug Kyprolis. We recognize that 
the unique disease epidemiology of malaria calls for stringent criteria 
in drug safety, stability, and delivery, and we are currently optimiz- 
ing WLL-vs to generate a class of compounds with improved potency 
and oral bioavailability. Previous success in optimizing peptide-based 
agents”? for efficient oral delivery offers promise in this class of 
inhibitors as next-generation anti-malarials. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Materials for substrate profiling. Proteasome substrates Suc-LLVY-amce and 
human 20S proteasome were purchased from Boston Biochem. The P. falciparum 
20S proteasome was purified as previously reported!'. The human PA28« was a 
gift from Boston Biochem. The peptide library for substrate cleavage profiling was 
used as described in ref. 15. 

Fluorogenic substrate assay. Substrate assays were performed in black 96-well 
Costar plates. For these assays, purified human or P falciparum 20S proteasomes 
were activated using human PA28q!, Each enzyme was added to a final concentra- 
tion of 1nM and the reaction proceeded in the presence of 12.5,1.M Suc-LLVY-amc 
for 1h at room temperature (21°C). The slope over the linear range of the reaction 
was determined for each of the enzymes. All experiments were performed at least 
in triplicate. 

Multiplex substrate profiling of human and Plasmodium proteasomes. The 
peptide library consisting of 228 tetradecapeptides was designed according to an 
algorithm previously reported’. Peptides were split into 2 pools of 114 for optimal 
liquid chromatography separation. P. falciparum and human 20S proteasomes were 
assayed at 2M, in the presence of 24nM of PA28a and 500 nM of each peptide. All 
assays were performed at room temperature in 20 mM HEPES, pH 7.5, and 0.5 mM 
EDTA. Aliquots were removed and acid quenched to pH 3.0 or less with formic 
acid (4% final) after incubation for 15, 60, and 240 min. A control sample lacking 
enzyme was also prepared under identical conditions to account for non-enzymatic 
degradation of the substrates. Each sample was de-salted with C18 tips (Rainin), 
and ~130ng of total peptide was injected for mass spectrometry analysis. Peptide 
separation was performed using an EZ-Spray C18 column (Thermo, ES800, 
PepMap, 3 jm bead size, 751m x 15cm). Reverse-phase chromatography was 
performed at a flow rate of 600 nl min”! for loading and 300nl min“! for using 
a nanoACUITY (Waters) ultra-performance liquid chromatography instrument. 
A 60min linear gradient from 2% to 30% acetonitrile in 0.1% formic acid was used, 
and mass spectrometry was performed on a LTQ Orbitrap XL. Survey scans were 
recorded over a 350-1500 m/z range, and peptide fragmentation was induced with 
collision-induced dissociation for tandem mass spectrometry. Collision-induced 
dissociation was performed on the six most intense precursor ions, with a min- 
imum of 1,000 counts, using an isolation width of 2.0 atomic mass units, with 
35% normalized collision energy. Data were analysed as previously reported using 
the in-house PAVA software to generate centroid peak lists, Protein Prospector 
software version 5.10.15 for data searches, and iceLogo software for visualization of 
amino-acid frequency surrounding the cleavage sites'®. The Z-score was calculated 
using the standard formula and indicates deviation from the mean frequency of a 
specific amino acid on a specific position in a reference set of sequences. 

P. falciparum replication assays. P. falciparum D10 cultures were maintained, 
synchronized, and lysed as previously described". P falciparum D10 was obtained 
originally from the Malaria Research and Reference Reagent Resource Center 
(MR4). Parasite cultures were grown in human erythrocytes (no blood type 
discrimination) purchased from the Stanford Blood Center (approved by Stanford 
University). Ring stage P. falciparum culture at 2% parasitaemia and 0.5% 
hematocrit was added to 96-well plates spotted with compounds. The P. falciparum 
culture was incubated with each compound for 1h and 72h. After compound treat- 
ment for 1h, the inhibitor was removed and the culture was washed three times in 
media, before fresh media was added for a further 71h of growth. After incubation, 
the culture was fixed in a final concentration of 1% paraformaldehyde (in PBS) 
for 30 min at room temperature. The nuclei stain YOYO-1 was then added to a 
final concentration of 50 nM and incubated overnight at 4°C. Parasite replication 
was monitored by observation of a YOYO-1-positive population (infected) and 
YOYO- 1-negative population (uninfected) using a BD Accuri C6 automated flow 
cytometer. Values in Extended Data Fig. le are the average + s.d.; n= 6 parasite 
cultures from two independent experiment of triplicates. 
Cell viability assay in mammalian cells. HFFs used in all experiments were 
non-transformed primary cells (obtained from J. Boothroyd) derived from 
human foreskin tissues and tested for mycoplasma contamination. HFFs were 
plated at 2,500 cells (non-confluent) per well <24h before addition of the com- 
pound. Compounds were diluted for dose-response concentrations in media, and 
added to the cells for 72h. Cell viability was measured using the CellTiter-Blue 
Assay (Promega) as per the manufacturer’s instructions. Values in Extended Data 
Fig. le are the average + s.d.; n=9 cell cultures from three independent experi- 
ments of triplicates for HFF treatment, except for 1h WLW-vs, 1h LLW-vs, and 72h 
LLL-vs, where n= 6 cell cultures from two independent experiment of triplicates. 
Activity-based probe labelling of mammalian and P. falciparum proteasome. 
BMV037 was used at a final concentration of 101M to label P. falciparum pro- 
teasome activity, and MV151 was used at a final concentration of 211M to label 
mammalian proteasome activity. Purified proteasomes (human or P. falciparum, 
10nM) were incubated with each inhibitor for 1h at 37 °C before addition of 


LETTER 


BMV037 or MV 151 for a further 2h at 37°C. Samples were denatured by addi- 
tion of SDS sample buffer, boiled briefly and run on a 12% SDS PAGE. Gels were 
scanned on the Cy5 channel (for BMV037) or the Cy3 channel (for MV151) ona 
Typhoon Scanner (GE Healthcare). Quantification of the intensity of the labelled 
proteins was done using Image J. A small amount of activation in the 31-subunit 
was observed in the human proteasome after inhibitor treatment. 

Assessment of P. falciparum proteasome activity in intact inhibitor treated 
parasites. P falciparum was cultured to around 10% parasitaemia to ensure suf- 
ficient parasites were available for proteasome labelling. P. falciparum culture 
was first treated for the indicated amount of time, and spun down at 3,200 1r.p.m 
(962g) to remove inhibitor. Each culture was then washed once in fresh media and 
P._ falciparum lysate obtained as previously described’. Proteasome labelling 
of the parasite lysate was performed as described above. To assess proteasome 
inhibition, the intensity of the proteasome labelling was quantified by Image J and 
the amount of proteasome inhibition was taken as a percentage of the dimethylsul- 
foxide (DMSO)-treated control. 

Assessment of WLL-vs activity and DHA synergism against ART-sensitive 
and -resistant P. falciparum. Sensitivity of PL2 and PL7 P falciparum strains (at 
2h after invasion) to a 1-h pulsed treatment with WLW-vs was determined as 
described previously”. Interactions between DHA and WLW-vs against the K13 
mutant strain, PL7, exposed to a 3-h pulsed treatment was determined at 4h after 
invasion as described previously”’. 

In vivo assessment of WLL-vs. All mouse experiments were approved by the 
Stanford Administration Panel on Laboratory Animal Care and strictly followed 
their specific guidelines. Sample size was determined on the basis of the minimum 
number of animals required for good data distribution and statistics. We ensured 
all animals in vehicle- and compound-treated groups had similar body weight 
before treatment. No inclusion or exclusion criteria were used as animals were 
randomly chosen for each group. No blinding was performed between the groups. 
For each drug assay, Balb/c female mice (~20 g, 6-8 weeks old) were infected by 
intraperitoneal injection with 1 x 10° P chabaudi parasites isolated from an infected 
mouse on day 0. For the experiment presented in Fig. 4, the infected mice were 
dosed 3 days after infection with a single bolus tail vein injection either of vehicle 
(n=6) comprising 10% DMSO, 10% (2-hydroxypropyl)-8-cyclodextrin in sterile 
PBS, or of WLL-vs (1 =6) formulated in the vehicle. For the experiment presented 
in Extended Data Fig. 4c, the infected mice were dosed 2 days after infection with 
a single bolus intraperitoneal injection either of vehicle (45% polyethylene glycol 
(relative molecular mass 400), 35% propylene glycol, 10% ethanol, 10% DMSO and 
10% (w/v) 2-hydroxyproyl-3-cyclodextrin), or WLL-vs at 40 mgkg~', 60 mgkg™', 
and 80 mgkg ! body weight formulated in the vehicle. Both of the treatments were 
administered at night (21:00). Parasitemia was monitored daily by Geimsa-stained 
thin blood smears obtained from the tail vein and quantified by light microscope 
counting, where a minimum of five fields, each containing at least 400 red blood 
cells, were counted. The health of the mice was also monitored daily by measuring 
their body weight and by assessment of their appearance and activity. 
Assessment of P. falciparum cysteine protease activities. A mixed stage culture 
of red blood cells infected with P. falciparum D10 was treated with a range of 
concentrations of WLL-vs for 1h at 37°C. BODIPY-TMR-DCG04 (21M) was then 
added to the culture for a further 1 h for live cell labelling of the activities of the 
parasitic cysteine proteases. JPM-OEt”? (100|1M) was added as a positive control. 
After probe labelling, the parasite pellet was isolated by saponin lysis and kept at 
—80°C. For further analysis, the samples were thawed at 4 °C and lysed directly 
in sample buffer and loaded onto a 15% SDS-polyacrylamide gel electrophoresis. 
Fluorescence of the probe was detected using the Typhoon as described above. 
Giemsa stain of WLL-vs treated P. falciparum culture. P. falciparum culture at 
mid-ring (9-12h after invasion) was treated with DMSO or WLL-vs for 1h at 37°C 
and the inhibitor was removed. The culture was washed in media and left to grow 
in fresh media for a further 23h. Culture was then spun down and the red blood 
cell pellet used to make a thin blood smear on glass slides, fixed with methanol 
and stained with Giemsa stain for light microscopy. 

Chemical synthesis. The detailed procedures for chemical synthesis are presented 
in Supplementary Information. 

Cryo-EM data acquisition. A sample of P. falciparum 20S proteasome core was 
diluted in 50 mM Tris-HCl, pH 7.5, 5mM MgCh, and 1 mM dithiotreitol, to a 
concentration of 0.14|.M (0.1 mg ml~'), and incubated in the presence of 10}1.M 
WLW-vs for 1h at 37°C. The sample was applied onto Quantifoil 1.2/1.3 electron 
microscope grids that were coated with a thin layer of carbon, freshly floated from 
mica, and glow-discharged in air. The grids were flash frozen by plunging into 
liquid ethane using a Vitrobot (FEI), operated at 22°C and 95% humidity, with 
a 2.5s blot time. The grids were transferred into a FEI Titan Krios microscope, 
operated at an acceleration voltage of 300 keV and a nominal magnification of 
x 75,000 (yielding a calibrated sampling of 1.04 A per pixel at the image level). 
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Images were recorded (from a single cryo-EM grid and during a single cryo-EM 
session) on a Falcon II direct electron detector, using EPU software, at a range of 
1.70-3.16|1m underfocus and 1 s exposure time (Extended Data Fig. 3a). For each 
exposure 17 individual frames were captured at a rate of 0.056s per frame, each 
corresponding to an electron dose of 2.8 electrons per square angstrém. For each 
exposure, the image corresponding to the sum of all frames was used for screening 
for optimal ice thickness and image stability, as assessed by the isotropic recovery 
of Thon rings in their power spectra. From the 2,603 images recorded, 1,816 were 
selected for further processing. 

Image processing. We used the same strategy for image analysis as for the recent 
determination of the structure of an inhibitor-bound human 20S proteasome 
core”, For each exposure selected for image analysis, the sum of all frames captured 
was used to determine the defocus and astigmatism parameters, using the Tigris 
program findctf, and for particle selection, performed automatically in Relion*!. 
This automated particle selection was carefully supervised with the manual removal 
of false positives and addition of false negatives. Further processing was performed 
using the sum of frames 3-10 of each exposure, effectively acquired within 0.45s 
and corresponding to a total accumulated electron dose of 28 electrons per square 
angstrém. This frame selection aimed at an adequate signal-to-noise ratio, for accu- 
rate determination of particle orientation, and the preservation of high-resolution 
information, as previously discussed”’. The phases of the resulting images were 
corrected for the contrast transfer function by phase flipping with the Tigris pro- 
gram flipctf, using the image defocus and astigmatism parameters determined as 
described above. Selected particle images were then extracted into 256 pixel x 256 
pixel boxes, resulting in a data set of 97,720 particles. The single-particle analysis 
refinement routine was performed using C2 symmetry and used the cryo-EM 
structure of the human 20S proteasome core (Electron Microscopy Data Bank 
accession code EMD-2981) as starting reference. This routine consisted of rounds 
of alignment and Euler angle assignment by projection matching, using the 
program AP SH of the Spider software package’, 3D reconstruction, using icr3d 
(which includes correction for the amplitude component of the contrast transfer 
function), and forward projection, using icr3dpro. The icr3d and icr3dpro 
programs are implemented in Tigris, a software package publicly available at http:// 
sourceforge.net. A resolution limited approach was used to avoid noise over-fitting 
during refinement, where the 3D map obtained in each refinement cycle was care- 
fully inspected, masked, and Fourier low-pass filtered to a conservative threshold 
in preparation of the reference for the next cycle. The surface representations of 
the final map as determined by the icr3d software (Fig. 2a, b), without further 
sharpening, Fourier filtering, or masking, were created using UCSF Chimera*? and 
colour coded according to local resolution as determined by ResMap**. 


Molecular modelling. Molecular models for each of the P. falciparum proteasome 
20S core subunits were originally obtained using the Phyre2 Protein Homology 
Recognition Engine®, with the sequences deposited in the PlasmoDB (Plasmodium 
Genomics Resource). A preliminary fitting of these models was performed using 
molecular dynamics flexible fitting**, which was followed by real-space refine- 
ment in Coot* and Phenix**. Protein regions that were not well recovered in our 
map were not modelled. The final model was checked for geometry, close con- 
tacts, and bond parameters using MolProbity®. The statistics obtained (Extended 
Data Fig. 3c) compared favourably with those for models determined by X-ray 
crystallography at similar resolutions, indicating that there was no over-fitting dur- 
ing model building. The resolution of the cryo-EM map can therefore be assessed 
by Fourier shell correlation against a density map generated from these coordinates, 
yielding a value of about 3.6 A (Extended Data Fig. 3d). The mesh representations 
of the final map showing the fitted coordinates of the P falciparum 20S proteasome 
core (Fig. 2c-g) were created using the PYMOL Molecular Graphics System. For 
these representations, the map was sharpened with a B-factor of —50 and Fourier 
low-pass filtered to 3.6 A. 
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Extended Data Figure 1 | Activity, substrate specificity and inhibition 
of human and P. falciparum 20S proteasome. a-c, Substrate cleavage 
profile of activated human and P. falciparum 20S proteasome. a, Activation 
of the human and P._ falciparum 20S proteasome by human PA28a. Activity 
was determined by cleavage of the fluorogenic substrate Suc-LLVY-amce. 
Error bars, s.d. n = 3 purified proteasome in technical replicates. 

b, c, iceLogos of cleavage sequences that are uniquely processed either by 
the P falciparum (b) or by the human proteasome (c). Amino acids that 
are most and least favoured at each position are shown above and below 
the axis, respectively. Lower-case ‘n, norleucine; amino acids in black 

text are statistically significant (P < 0.05, unpaired two-tailed Student’s 
t-test). d, e, Inhibition potencies of the vinyl sulfone inhibitors. d, Table 

of ICs values for each inhibitor in 1-h treated P falciparum and human 


20S proteasome. ICs values are determined from three independent 
experiments of inhibitor pretreatment followed by activity labelling of the 
20S proteasome (n = 3 purified proteasome). Gels in Fig. 1f and Extended 
Data Fig. 2b were quantified to calculate the ICso values (for gel source 
data and replicates, see Supplementary Fig. la, b). Data are mean + s.d. 

e, Table of ECs9 values for each of the inhibitors in 1 h and 72h treatment 
of P. falciparum at ring stage or non-confluent HFFs. Data are mean + s.d.; 
n=6 parasite cultures from two independent experiment of triplicates 
for P. falciparum treatments; n= 9 cell cultures from three independent 
experiments of triplicates for HFF treatment, except for 1h WLW-vs, 1h 
LLW-vs, and 72h LLL-vs, where n = 6 cell cultures from two independent 
experiments of triplicates. 
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Extended Data Figure 2 | Proteasome inhibitors preferentially inhibit 
82 of the P. falciparum proteasome. a, Vinyl sulfone inhibitors are 
synthesized from the Boc-protected amino acid by first generating the 
Weinreb amide, followed by the Horner-Wadsworth-Emmons reaction 
and standard peptide coupling. b, Purified human 20S proteasome was 
pre-treated for 1 h at 37°C with each inhibitor followed by addition of 
activity-based probe MV151 (ref. 40) to assess for human proteasome 
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activities (for gel source data, see Supplementary Fig. 1b). c, HFF or 

P. falciparum culture was treated for 1h at 37°C with each inhibitor, 
followed by compound washout and post-lysis activity-based probe 
labelling. Gel shown for WLL-vs in P. falciparum is derived from Fig. 4c 
at the indicated concentrations to allow for direct comparison with other 
compounds (for gel source data, see Supplementary Fig. le). 
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Extended Data Figure 3 | Evaluation of the final cryo-EM map 

and molecular model of the P. falciparum 20S proteasome core. 

a-e, Evaluation of the single-particle analysis of the P. falciparum 20S 
proteasome core bound to the inhibitor WLW-vs. a, Cryo-EM image of 
the sample analysed, with molecular images of side views of the complex 
(normal to its long axis) indicated by rings. The image greyscale was 
inverted to show the protein densities in white. b, Individual sections of 
the 3D map, as determined by the 3D reconstruction algorithm (without 
further sharpening, masking, or Fourier filtering), are represented as 
grey scale. These sections are 1 A thick and reveal the quality of the 
reconstruction, as the protein densities are clearly resolved against a very 
smooth background, with regions showing the pattern of a-helices (box) 
and the clear separation of sheet-forming 3-strands (arrows) indicated. 

c, Evaluation of the model of the P falciparum 20S proteasome core using 
MolProbity*®. d, Resolution estimate of the cryo-EM map by Fourier shell 
correlation. The curves correspond to the correlation obtained against the 
protein model (red) and the correlation between maps determined from 
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two halves of the data (blue). The resolution was estimated from the curve 
against the model where the 0.5 correlation coefficient criterion*’ yields an 
estimate of 3.6 A. The correlation coefficient can be seen to fall to a local 
minimum at ~6 A and then recover at higher resolutions for both Fourier 
shell correlation curves. This behaviour is consistent with the rotationally 
averaged amplitude spectra of both the cryo-EM map and the coordinates 
(e). This region of the amplitude spectra contains reduced structural 
information, typical of protein scattering, indicating that these effects in 
the Fourier shell correlation curves arise from a genuine local reduction 

in the signal:noise ratio. f-h, Accessibility of the human 20S proteasome 
active sites to the inhibitor WLW-vs, using the protein model of the 
human proteasome core complex bound to an LLL-vs inhibitor”®. Protein 
coordinates of the human proteasome 20S core (PDB 5A0Q) £2 (f), 81 (g), 
and 85 (h) active sites were aligned to the coordinates of the P. falciparum 
proteasome (2-subunit bound to the WLW-vs inhibitor. The model of the 
human 20S proteasome active sites is represented as van der Waals surfaces 
with the superimposed WLW-vs inhibitor shown as sticks. 
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Extended Data Figure 4 | Intact cell treatment and in vivo treatment of 
vinyl sulfone inhibitors. a, WLW-vs was incubated in early trophozoite 
P. falciparum culture for 3 h, washed out, and the parasite lysate was 
incubated with probe BMV037. Top gel, the fluorescent scan; bottom gel, 
the silver stain. For gel source data, see Supplementary Fig. 1f. b, Body 
weight of WLL-vs- and vehicle-treated Balb/c mice after compound 
treatment by tail vein injection (Fig. 4e), expressed as a percentage of the 
original body weight on day 3 before compound treatment. Body weight 
of vehicle-treated mice decreased after day 6 of infection as part of the 
response to the natural resolution of the P. chabaudi infection. Treatment 


day is indicated by arrow; n =6 mice for each group; error bars, s.d. 

e, Balb/c female mice infected with 1 x 10° P. chabaudi parasites from 
passage host on day 0 were treated with a single bolus dose of vehicle (45% 
polyethylene glycol (relative molecular mass 400), 35% propylene glycol, 
10% ethanol, 10% DMSO, and 10% (w/v) 2-hydroxyproyl-3-cyclodextrin; 
n=4 mice) or WLL-vs at 40 mgkg "| (n=5 mice), 60mgkg ' (n=4 mice), 
and 80 mgkg~! (1 =3 mice) formulated in the vehicle. Treatment 

was performed on day 2 after infection as indicated by the arrow and 
administered by intraperitoneal injection. Parasitaemia was monitored 
daily by Giemsa stain of thin blood smears. Error bars, s.d. 
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Extended Data Figure 5 | Assessing off-target activities of WLL-vs. 

a, Structures of WLL-vs and its diastereomer WL(d)L-vs. b, Dose-response 
curves of WLL-vs and WL(d)L-vs after 72 h treatment in P. falciparum. 
Error bars, s.d. (n= 6 parasite cultures for WLL-vs from triplicates of two 
independent experiments, and n=8 parasite cultures for WL(d)L-vs over 
three independent experiments). c, Purified P falciparum 20S proteasome 
was treated for 1h at 37°C with 10,1M of WLL-vs and WL(d)L-vs (left) 

or a range of concentrations of WL(d)L-vs. Residual activity was assessed 
by probe BMV037 (for gel source data, see Supplementary Fig. 1g). 


d, A mixed-stage culture of P. falciparum was treated for 1h with WLL-vs 
at 37°C, followed by BODIPY-TMR-DCG04 for a further 1h. Samples 
were directly loaded onto SDS-polyacrylamide gel electrophoresis 

for analysis. JPM-OEt (100 1M) was included as positive control. The 
fluorescent scan is shown at the top and the Coomassie stain is shown at 
the bottom. For gel source data, see Supplementary Fig. 1h. e, Geimsa stain 
of 1h treated P. falciparum ring 24h after inhibitor was added. Scale bar, 
600 jim. 
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Backbone NMR reveals allosteric signal transduction 
networks in the 8,-adrenergic receptor 


Shin Isogai’, Xavier Deupi*, Christian Opitz', Franziska M. Heydenreich?, Ching-Ju Tsai’, Florian Brueckner’, 
Gebhard F. X. Schertler*, Dmitry B. Veprintsev?? & Stephan Grzesiek! 


G protein-coupled receptors (GPCRs) are physiologically important 
transmembrane signalling proteins that trigger intracellular 
responses upon binding of extracellular ligands. Despite recent 
breakthroughs in GPCR crystallography!’, the details of ligand- 
induced signal transduction are not well understood owing to 
missing dynamical information. In principle, such information can 
be provided by NMR‘, but so far only limited data of functional 
relevance on few side-chain sites of eukaryotic GPCRs have been 
obtained*°. Here we show that receptor motions can be followed 
at virtually any backbone site in a thermostabilized mutant of 
the turkey (,-adrenergic receptor (8,;AR)!°!”. Labelling with 
[!°N] valine in a eukaryotic expression system provides over twenty 
resolved resonances that report on structure and dynamics in six 
ligand complexes and the apo form. The response to the various 
ligands is heterogeneous in the vicinity of the binding pocket, but 
gets transformed into a homogeneous readout at the intracellular 
side of helix 5 (TM5), which correlates linearly with ligand 
efficacy for the G protein pathway. The effect of several pertinent, 
thermostabilizing point mutations was assessed by reverting them to 
the native sequence. Whereas the response to ligands remains largely 
unchanged, binding of the G protein mimetic nanobody NB80 
and G protein activation are only observed when two conserved 
tyrosines (Y227 and Y343) are restored. Binding of NB80 leads to 
very strong spectral changes throughout the receptor, including 
the extracellular ligand entrance pocket. This indicates that even 
the fully thermostabilized receptor undergoes activating motions 
in TM5, but that the fully active state is only reached in presence 
of Y227 and Y343 by stabilization with a G protein-like partner. 
The combined analysis of chemical shift changes from the point 
mutations and ligand responses identifies crucial connections in the 
allosteric activation pathway, and presents a general experimental 
method to delineate signal transmission networks at high resolution 
in GPCRs. 

A thermostabilized, detergent-resistant mutant of B,AR!0-12 
(TS-B,AR; see Methods) was selectively labelled with [!°N]valine and 
produced in insect cells without further chemical modifications. Its 
28 valine residues are homogeneously distributed across the receptor 
(Extended Data Fig. 1) at locations suitable to sense ligand binding 
and receptor activation. Although resonances of main chain atoms 
are considerably more difficult to observe than those of mobile side 
chains of surface residues, they are expected to be better reporters of 
functional, long-range backbone motions. We succeeded to obtain 
well-resolved TROSY (transverse relaxation-optimized spectroscopy) 
spectra of the valine 'H-'°N backbone resonances of detergent- 
solubilized TS-B,AR in its apo form and in complexes with six ligands 
(Extended Data Fig. 1) ranging in their efficacy from antagonists to 
agonists (Extended Data Table 1). Despite the absence of deuteration 
and very short T, relaxation times (~4 ms for 1H), 26 valine reso- 
nances could be observed with sufficient sensitivity and resolution. 


Distinct and reversible chemical shift changes were detected for 
many valines after ligand exchange. 16 valines were assigned unam- 
biguously and 5 tentatively using spectra from 18 point mutants, as 
well as further spectral and structural information (Extended Data 
Table 2). 

Many valine residues in the vicinity of the ligand binding pocket 
could be assigned, showing chemical shift changes that report on the 
ligand functional groups (Fig. 1). Remarkably, residue V172(4.56) 
(the number in parenthesis corresponds to the Ballesteros— Weinstein 
numbering system), which is located close to the ligand aromatic 
head group, exhibits an unusual !°N chemical shift of ~105-110 p.p.m. 
(Extended Data Figs 1, 2 and Fig. 1c). This anomaly seems caused by a 
distorted backbone geometry, which is presumably conserved among 
adrenergic receptors and results from a missing hydrogen bond to the 
proline at position 176(4.60) (Extended Data Fig. 2). Instead, the car- 
bonyl of V172(4.56) participates in a water-mediated hydrogen bond 
network, which connects the ligand binding site, TM3, TM4, TM5, and 
TMo6'*"*. Seemingly as a result of these interactions, the V172(4.56) 
'H-1°N resonances cluster according to the substitution patterns of 
the ligand head group (Fig. 1c): one cluster is observed for the partial 
agonists/antagonists cyanopindolol, alprenolol and carvedilol, which 
have larger head groups with ortho- and/or meta-substitutions; a sec- 
ond cluster is observed for the agonists isoprenaline and dobutamine, 
which bear a meta- and para-substituted catechol ring. We attribute 
the distinct chemical shifts for isoprenaline or dobutamine to the loss 
of a coordinated water caused by specific hydrogen bond interactions 
between their catechol moieties and the side chain of $215(5.461) 
(Extended Data Fig. 2d). 

Remarkably, the resonance positions of V172(4.56) in complex 
with the antagonist atenolol strongly differ from the already described 
complexes and the apo form: considerable 'HN (>0.4p.p.m.) and N 
(>4p.p.m.) upfield shifts indicate, respectively, a further weakening 
of the main chain hydrogen bond V172(4.56)HN.--1168(4.52)O and a 
stronger kink of the backbone. This rearrangement seems caused by 
the insertion of the para-acetamide group of the ligand head between 
residues $215(5.461) and V172(4.56) (Extended Data Fig. 2d). This 
will lead to a substantial disruption of the TM3-TM4-TMs5 inter- 
face, thereby precluding receptor activation, in agreement with aten- 
olol’s inverse agonist pharmacology. Thus, the amide chemical shifts 
of V172(4.56) constitute a very sensitive readout for the state of this 
water-mediated, inter-helical activation switch. 

The 'H-'°N chemical shifts of further residues in the vicinity of 
the binding pocket report on additional characteristics of the ligands. 
V314(6.59) and V202(ECL2) are located at the extracellular surface 
of the receptor in a “vestibule” next to the entry/exit pathway of the 
orthosteric binding site’>®. The resonances of these residues are either 
severely broadened or undetectable in the absence of ligands, whereas 
they are observable in the presence of ligands (Fig. 1d and Extended 
Data Fig. 3). A line shape analysis for V314(6.59) (Extended Data Fig. 4) 
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Figure 1 | Ligand-induced 'H-1°N chemical shift changes in the 

vicinity of the ligand binding pocket of 8) AR. a, Partial view of the 

8, AR-carvedilol crystal structure (4AMJ) showing valine residues (blue 
spheres) in the vicinity (<8.5 A) of the ligand (magenta sticks) binding 
site. b, Chemical structures of the 8; AR ligands used in this study. Ligand 
affinities derived from whole-cell binding assays on the thermostabilized 
836-m23 3,AR construct’ are indicated as pKp values. Similar pK values 
were measured for the TS-3,AR construct (Extended Data Table 1). c, Left, 
ligand-induced response of V172(4.56) 'H-1°N resonances. The black bar 
represents a scale of 0.1 p.p.m. ("H) and 1 p.p.m. (1°N). The labels 0, m, p 
indicate ligands with ortho, meta, and para substitutions at the head group, 
respectively. c, Right, partial view of the 8; AR-carvedilol structure (4AMJ) 
showing the interaction network connecting V172(4.56) to S215(5.461), 
P219(5.50), 1129(3.40), and F299(6.44). d, Left, representation as c, left, for 
the 'H-!°N resonances of V314(6.59). Centres of resonances are indicated 


indicates that this extracellular part of the receptor undergoes micro- 
to millisecond motions in the apo form, which are quenched by ligand 
binding. This is consistent with results on 8,AR, which suggest that 
high-affinity ligands stabilize the conformation of ECL2 and ECL3°. 
The ligand-induced shifts of the V314 resonance correlate strongly 
(7? =0.95) with the reported ligand affinity’” (Fig. 1d). Interestingly, 
the resonances of V314(6.59) in the apo form and in high-affinity 
ligand complexes are very close. This may indicate that the high- 
affinity ligand complexes mimic the average apo conformation. 
Finally, the 'H-'°N chemical shifts of V125(3.36) at the bottom of the 
binding site and of V103(2.65) close to the ligand tail reveal additional 
trends (Fig. le): the chemical shifts of V125(3.36) correlate with the 
depth of ligand insertion towards the central part of TM3 (r?=0.81), 
whereas those of V103 correlate with the volume of the ligand tail 
(7? =0.90). 

Compared to inactive 82AR, complexes of activated 32AR with either 
G protein’ or the G protein-mimicking nanobody NB80*' show large 
movements at the intracellular sides of TM5, TM6 and their interven- 
ing loop ICL3, which form the binding site for the G protein. These 
conformational changes are expected to be conserved throughout the 
GPCR family’’. Four valine residues could be assigned in this region 
of TS-8,AR: V226(5.57), V230(5.61), V280(6.25), and V298(6.43) 
(Fig. 2). In contrast to the chemical shift changes in the vicinity of the 
ligand binding pocket, which depend strongly on the ligand chemistry, 
the shifts of the TM5 residues observed in this region report on ligand 
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75 175 
ad'H + bd'5N + c (p.p.m.) 


by circles. d, Middle, correlation of a best-fit linear combination of the 
V314(6.59) chemical shifts (—49.3 §'H + 2.02 §'5N + 133) to the ligand 
affinity pKp (Extended Data Table 1). d, Right, partial view of the 
carvedilol (red, 4AMJ), dobutamine (orange, 2Y01), cyanopindolol 
(green, 2VT4 B), and isoprenaline (blue, 2Y03) complex structures 
showing the ligand-induced movement of V314(6.59). e, Left, 
representation as d, left, for the 'H-!°N resonances of V125(3.36) and 
V103(2.65). e, Middle, correlations of best-fit linear combinations of 
chemical shifts for V125(3.36) (—0.402 6'H + 1.17 6&°N — 132) and 
V103(2.65) (—336 §'H — 634 §!°N + 7.64-104) to the ligand insertion 
depths and tail volumes (Extended Data Table 1), respectively. 

e, Right, partial view of the carvedilol, dobutamine, cyanopindolol, and 
isoprenaline crystal structures (representation as d, right) showing the 
ligand-induced movement of V125(3.36), D121(3.32), and V103(2.65). 


efficacy. This effect is most prominent for residue V226(5.57), for which 
the 'H-!°N resonances fall on one line from antagonists to agonists 
(Fig. 2a). The chemical shifts for the different ligands correlate very 
strongly (1° =0.89) with their reported’’ efficacies for Gs signalling 
(Extended Data Table 1). This highly linear effect suggests that the 
receptor filters the diverse input signals from the various ligands to 
a unified and precise structural response on TM5, which can be read 
out by the chemical shifts of V226(5.57). Interestingly, the V226(5.57) 
atenolol peak is situated at a position corresponding to lower efficacy 
than for the apo receptor. This gives direct structural evidence of aten- 
olol’s inverse agonist action, which reduces the activation relative to the 
basal level of the apo receptor. 

Current high-resolution structures of 8; AR do not show significant 
changes between antagonist- and agonist-bound forms (Fig. 2b). The 
decrease of the V226(5.57) 'HN chemical shift by about 0.2 p.p.m. 
from the agonist isoprenaline to the antagonist atenolol indicates a 
lengthening of the V226(5.57)-HN..-I222(5.53)-O hydrogen bond by 
about 0.05 A (ref. 20). This small, but clearly NMR-detectable length 
variation is below the resolution limit of current GPCR structures 
(Supplementary Text 1), but may indicate the start of TM5 bending 
towards the active conformation as observed in the G protein-bound 
form of 8,AR (Fig. 2b). Remarkably, this response to agonists occurs 
even in the thermostabilized receptor TS-8,AR. Albeit reduced in 
absolute size compared to V226(5.57), V230(5.61) displays similar 
linear chemical shift changes as a function of ligand efficacy for the 
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Figure 2 | Correlation of ligand-induced chemical shift changes at the 
TS-6, AR intracellular side with G, efficacy. a, Left, response of the 
V226(5.57) 'H-°N resonance to various ligands (colour coding as in Fig. 1). 
The centres of resonances are indicated by circles. The 'H-'°N resonances 
fall on one line from atenolol (antagonist) over apo to alprenolol (partial 
agonist), carvedilol (antagonist), cyanopindolol (partial agonist), dobutamine 
(full agonist) and isoprenaline (full agonist). a, Right, correlation of a 

best-fit linear combination of the V226(5.57) chemical shifts (—515 &'H 
—31.7 6°N +8.41-10) in different ligand complexes to their efficacy for 

the Gs signalling pathway’”. b, Left, overlay of TM5 and TM6 backbones of 
thermostabilized 8, AR in antagonist- (blue, PDB code 4AMJ) and agonist- 
bound (green, PDB code 2Y03) form. The agonist does not induce detectable 
helix movements. b, Right, TM5 and TM6 backbone movements upon 
activation in human 62AR. The overlay of inactive (blue, PDB code 2RH1) 
and G protein-bound B,AR (magenta, PDB code 3SN6) structures shows the 
large bend of TM6 along with the smaller conformational change of TM5 
upon activation. Hydrogen bonds 5.57-H...5.53-O and 5.61-H™...5.57-O are 
indicated by dashes. According to the behaviour of the 'HN chemical shifts 
of V226(5.57) and V230(5.61) in TS-3,AR, these hydrogen bonds expand 

in an efficacy-dependent manner during agonist binding. c, Response of the 
'H-1N resonances for V226(5.57), V230(5.61), V298(6.43), and V280(6.25) 
to various ligands. For clarity, only the centres of resonances are depicted 

as circles with colour coding as in Fig. 1. The black bar represents a scale 

of 0.1 p.p.m. and 1 p.p.m. for the 'H and !°N chemical shifts, respectively. 
The schematic representation of the receptor indicates the locations of the 
respective valine residues at the cytoplasmic sides of TM5 and TM6 within 
the helical bundle of 8; AR. 


G, pathway (Fig. 2c). As this residue is located one helical turn further 
towards the cytoplasm, the detected conformational change is not just 
local, but spans a certain length in TM5. 


LETTER 


Compared to V226(5.57) and V230(5.61) in TMS, the chemical 
shift response to ligands is much less pronounced for V298(6.43) and 
in particular V280(6.25) at the intracellular side of TM6 (Fig. 2c). 
This suggests that agonist binding to the TS-8, AR does not induce 
the large conformational change in TM6 observed in the activated 
B.AR-G protein? or 8.AR-NB80'8 complexes. However, G protein 
activation upon agonist binding has been reported for other less 
thermostabilized 3,AR constructs!’, indicating that they can still be 
activated, albeit at low levels. Thus we reverted the mutations most 
likely to interfere with the activation mechanism in TS-B,AR, that is, 
1129(3.40)V in the connector switch!®, Y227(5.58)A in TM5*!4, and 
Y343(7.53)L in the NPxxY motif of TM7!*”? to the native residues. 
These reverse mutants were then tested for G protein activation and 
the NMR response in TM6. A summary of the results is given in 
Extended Data Table 3. The original TS-3)AR, the single mutants 
TS-8,AR(V1291), TS-8;AR(A227Y), and TS-3,AR(L343Y) as well 
as the double mutant TS-8;AR(V129I/A227Y) showed no detecta- 
ble G protein activation upon isoprenaline binding (Extended Data 
Fig. 5). However, G protein activation was detectable for the least 
thermostable TS-8; AR(A227Y/L343Y) double mutant (T,, reduced 
by 11°C relative to TS-8;AR), which recovers the conserved tyrosines 
in TM5 and TM7 that are known to stabilize the active state of rho- 
dopsin”’. None of the reverse mutants showed major changes in the 
NMR spectra of various ligand complexes compared to the original 
TS-8,AR (Extended Data Fig. 6). In particular, residues V298(6.43) 
and V280(6.25) at the intracellular side of TM6 did not show an 
increased response to agonists. This is in agreement with recent DEER 
(double electron-electron resonance) and !°F-NMR data showing that 
agonists alone do not fully stabilize the active state of TM6 in 8.AR’. 
Interestingly, the V129(3.40)I and the V129(3.40)I/A227(5.58)Y 
mutations shifted the 'H-!5N resonances of V226(5.57) towards 
a more active (that is, bent) state of TM5 in both the atenolol- and 
isoprenaline-bound forms (Extended Data Fig. 6b), thereby given 
direct experimental evidence for an allosteric activation pathway 
spanning about 13 A from 1129(3.40) on TM3 to V226(5.57) on TMS. 

With the exception of rhodopsin’, the stabilization of fully active 
GPCR conformations seems to require binding of an agonist and 
an intracellular partner®?. Indeed, when both the agonist isoprena- 
line and the G protein-mimicking nanobody NB80!* were added to 
TS-B, AR(A227Y/L343Y), very large chemical shift responses for many 
valine residues in TM3-TM6 were observed, whereas no change was 
observed for several valines in TM1, 2, and 7 (Extended Data Fig. 7). 
This very strong response extends even to the extracellular residue 
V314(6.59), providing evidence of a long-distance connection from 
the G protein binding site to the ligand entry site. The strong chemical 
shift changes are reverted when the partial agonist/antagonist cyano- 
pindolol is added to the isoprenaline-TS-8; AR(A227Y/L343Y )-NB80 
complex. The spectrum then becomes identical to that of the ‘pure’ 
cyanopindolol-TS-8,;AR(A227Y/L343Y) complex (Extended Data 
Fig. 7a), indicating that cyanopindolol replaces isoprenaline and 
causes the release of NB80. In agreement with the G protein activa- 
tion data, the isoprenaline-bound original TS-3,; AR and the mutants 
TS-B,AR(A227Y) and TS-8, AR(L343Y) did not show binding of NB80 
in the NMR spectra. Moreover, supplementing NB80 to the ultrastable 
TS-B,AR did not change its affinity for isoprenaline (Extended Data 
Fig. 7b), whereas it caused a hundred-fold affinity increase in the case 
of TS-8,;AR(A227Y/L343Y) and the truncated native turkey 8, AR 
receptor (t3trunc)!”. This increase is identical to data for 8,AR" and 
shows the energetic coupling between the NB80 and agonist binding 
also for 8, AR. 

In combination, these data prove that agonist binding, even in the 
absence of a G protein mimic, induces initial changes in the conforma- 
tional equilibrium of TM5 towards the conformation observed in the 
G protein complex of 3,AR. Remarkably, these rearrangements occur 
in all thermostabilized forms of 8;AR. However, a full shift of the 
equilibrium towards such an active conformation, including allosteric 
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Figure 3 | Experimental detection of allosteric signalling pathways 
using the NMR response to ligand binding and point mutations at 
different backbone sites. a, Combined !H, !°N chemical shift deviations 
[A8 = (A8:47/2 + Af)5n7/50)/7] of valine resonances observed upon 
ligand binding or induced by the indicated point mutations. For ligand 
binding, the three pairwise deviations Aé were calculated between the 
apo, atenolol-, and isoprenaline-bound forms of TS-8,;AR. The maximum 
of these deviations is shown. For the reverse mutants, deviations A6 are 
shown relative to TS-8)AR for their apo (black), atenolol- (cyan), and 
isoprenaline-bound (blue) bound forms. Valines within 10 A from the 
Ca atom of the mutated amino acid are shown on a grey background. 
Distances were calculated using the coordinates of the thermostabilized 
8, AR (PDB code 4BVN). A red line marks a cut-off value A6 of 0.05 p.p.m. 
for significant chemical shift deviations. b, Topology of the signalling 


changes at the extracellular side, occurs only when G protein or its 
mimetic NB80 is bound. This process requires the presence of both 
Y227(5.58) and Y343(7.53), which significantly reduce the thermal sta- 
bility. Different active conformations may be reached for non-G protein 
effectors such as B-arrestin. 

The possibility to detect NMR signals at many receptor sites in 
response to ligand binding and point mutations provides an experimen- 
tal method to trace allosteric signalling paths. Figure 3 shows examples 
of these pathways, derived from the response to the ligands atenolol and 
isoprenaline and the single point mutations V129(3.40)I, A227(5.58)Y, 
and L343(7.53)Y. Choosing a cutoff of 0.05 p.p.m. for the resulting 
combined !H, !°N chemical shift change (Fig. 3a, red line), long- 
range (>10 A) connections become evident throughout the receptor 
(Fig. 3b, c). Whereas detected ligand signals radiate broadly to almost 
all helices, the point mutants give evidence of smaller interaction net- 
works connecting TM3 to TM4/5, TM5 to TM3/4/6, as well as TM7 
to TM2/3 (Fig. 3b, c). Interestingly, the TM2/TM7 network seems to 
be only weakly connected to the TM3-TM6 network. Together, these 
data provide experimental evidence at high resolution of an extensive 
signal transduction network that connects the ligand binding site to the 
intracellular sides of TM5, TM6, and TM7. Such a network of loosely 
coupled allosteric connections has been postulated previously for B,AR 
on the basis of molecular dynamics simulations”. 

In summary, we have shown that highly resolved solution NMR 
backbone spectra can be obtained for a eukaryotic GPCR. The NMR 
observations delineate the allosteric signalling pathways and com- 
prehensively connect many previous experimental and theoretical 
observations, which may ultimately allow to understand the dynamic 
mechanisms of GPCRs at the atomic level. 


Online Content Methods, along with any additional Extended Data display items and 


Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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network determined from point mutations (left) and ligand binding 
(right). Signal paths were identified by chemical shift deviations Aé larger 
than 0.05 p.p.m. induced by these two perturbations (a). Signal paths to 
valines within 10 A from the ligand or point mutation (that is, localized 
conformational changes) are indicated as dashed lines, and those beyond 
10A (long-range conformational changes) as solid lines. The ligand 
signals broadly towards all helices but TM1. In contrast, the network 
determined by the point mutations is more localized and connects TM3 
to TM4/5, TM5 to TM3/4/6, and TM7 to TM2/3. The latter network seems 
to be divided into two subnetworks involving TM3/4/5/6 and TM2/7. 

c, Long-range allosteric signal paths identified from ligand binding or 
point mutations (a and b) indicated on schematic 8; AR representations 
showing the involved TMs. Helices are colour-coded according to b. 
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METHODS 

8, AR constructs. The TS-3;AR mutant was derived from the turkey 8; AR44-m23 
mutant used in crystallographic studies” by adding three additional thermostabiliz- 
ing mutations (1129V, D322K, and Y343L) and a neutral mutation (D200E) from the 
ultra-stable 8; AR-JM3 mutant!!. As compared to the wild type, TS-;AR contains 
truncations at the amino and carboxy termini and intracellular loop (ICL3), a total 
of nine thermostabilizing point mutations, three further point mutations as well as a 
C-terminal hexahistidine tag (Extended Data Fig. 1). The final TS-8; AR sequence 
is MGAELLSQQWEAGMSLLMALVVLLIVAGNVLVIAAIGSTQRLQTLTN 
LFITSLACADLVVGLLV VPFGATLV VRGTWLWGSFLCELWTSLDVLCVTASVE 
TLCVIAIDRYLAITSPFRYQSLMTRARAKVIICTV WAISALVSFLPIMMHWWR 
DEDPQALKCYQDPGCCEFVTNRAYAIASSIISFYIPLLIMIFVALRV YREAKEQ 
IRKIDRASKRKTSRVMLMREHKALKTLGIIMGVFTLCWLPFFLVNIVNVFNR 
DLVPKWLFVAFNWLGYANSAMNPIILCRSPDFRKAFKRLLAFPRKADRRLH 
HHHHH. 

Additional valine-to-alanine or isoleucine point mutations were introduced 

into TS-8, AR for NMR assignment purposes. All constructs were made using the 
QuikChange site-directed mutagenesis method (Agilent). Baculovirus for insect 
cell expression was generated using the Bac-to-Bac system (Invitrogen). 
G1 AR expression and purification. All 3,AR constructs were expressed in bac- 
ulovirus-infected insect cells as described”®. Selective labelling by [!°N]valine 
was achieved by growing cells on unlabelled serum-free insect cell medium 
(InsectXpress, Lonza) and then exchanging into custom-made serum-free medium 
(SF4, BioConcept) devoid of valine and yeast extract, to which 100 mg1~! [!5N] 
valine were supplemented. Virus was added immediately after the medium 
exchange. The culture was harvested at 48 or 72h post infection. 

After cell lysis, the membrane fraction was separated from the lysate via ultra- 
centrifugation and subsequently solubilized with 2% n-decyl-}-p-maltopyranoside 
(DM, Anatrace). The solubilized membrane fraction was then purified by nickel 
ion affinity chromatography followed by alprenolol ligand affinity chromatography. 
The active receptor was eluted with buffer (20 mM Tris, 350 mM NaCl, 0.1% DM, 
pH 7.5) containing either atenolol (1 mM) or alprenolol (0.1 mM). Final yields 
of detergent-solubilized receptor were 1.5mg1~! of cell culture. The molecular 
weight of the receptor-detergent complex was estimated as ~100 kDa by static 
light scattering. 

Thermal shift assays of mutant receptors. Detergent-solubilized, purified 
apo TS-8)AR and reverse-mutation receptors for thermal stability assays were 
obtained from their atenolol-bound form by washing with buffer devoid of 
ligand on a HiTrap SP HP (GE Healthcare) column. Their thermal stability was 
determined by the microscale fluorescent stability assay for binding of the thiol- 
specific fluorochrome N-[4-(7-diethylamino-4-methyl-3-coumarinyl)phenyl] 
maleimide (CPM)?’ in a Rotor-Gene Q (QIAGEN) real-time PCR cycler using 
1g of receptor in 20mM Tris, 350 mM NaCl, 0.1% DM, pH 7.5 and a heating 
rate of 2K min}. 

NMR experiments. NMR samples were prepared in Shigemi tubes as 250 1] vol- 
umes of typically 100|1M receptor, 1 mM ligand (except for apo form), 20 mM 
TRIS, 100 mM NaCl, 0.1% DM, 5% D0, pH 7.5. For isoprenaline or dobutamine, 
2mM of sodium L-ascorbate were supplemented as anti-oxidant. All solution NMR 
measurements were carried out at 304K on a 800 MHz or a 900 MHz Bruker 
Avance III spectrometer equipped with a cryogenic probe. 2D 'H,"N TROSY 
spectra were recorded with total acquisition periods of 16 ms ('°N) and 43 ms 
(1H) with typical total experimental times of 24-48 h. As compared to a standard 
TROSY pulse sequence, the 'H-'*N INEPT delays were set to 3 ms to reduce 
magnetization losses from relaxation. 

Assignment procedure. To obtain sequence-specific assignment information, we 
initially attempted to detect HNCO and HNCA correlations on samples addition- 
ally labelled with !°C at specific backbone sites**. However, due to low sensitivity, 
only very few correlations were observable. Therefore, assignments were obtained 
from a combination of information from TROSY spectra recorded on 18 TS-8;AR 
valine point mutants with different ligands, four HN(CO) correlations and five 
distinct structure-based chemical shift predictions (Extended Data Table 2). 
Ligand exchange experiments. Receptor complexes with different ligands were 
generated by sequential exchange according to increasing ligand affinity, that 
is, in the sequence atenolol-isoprenaline-dobutamine-alprenolol or alprenolol- 
carvedilol-cyanopindolol. For exchange, the sample was washed three times with 
buffer devoid of ligand at tenfold dilution in Amicon Ultra 50 kDa cutoff concen- 
trators. Subsequently, the sample was washed again twice with buffer containing 
100 1M new ligand, separated by a period of 1h incubation. Final concentrations of 
the ligands were adjusted to 1 mM. Apo receptor was generated from the atenolol 


complex by six washing steps of tenfold dilution in ligand-free buffer using a 1h 
incubation period for the last three steps. 

NMR NB80 binding experiment. Binding of NB80 to 3; AR mutants was assessed 
using TROSY and 1D proton NMR spectra. These spectra were recorded on the 
8, AR mutants (TS-3, AR: 132 1M, TS-3,AR(A227Y): 120M, TS-8;AR(L343Y): 
110M, and TS-8,;AR(A227Y/L343Y): 120,1M) in the presence of saturating 
amounts (1 mM) of the agonist isoprenaline before and immediately after addition 
of an equimolar (relative to the receptor) amount of NB80. For TS-8; AR(A227Y/ 
L343Y) additional spectra were recorded after a further addition of the partial 
agonist cyanopindolol (1 mM) to the already present isoprenaline and NB80. 
Scintillation proximity assay with 7H-dihydroalprenolol. For pharmacological 
binding assays membranes were prepared from SF9 insect cells as described 
previously”®, The total protein content of the membranes was estimated by A280 
measurements using an average extinction coefficient of 1.0permgml!. All 
assays were carried out in 96-well plates at 200 1g ml total protein in membranes 
and 2mgml-! WGA-YSi beads (Perkin-Elmer) in a 100 ul total volume per well. 
Samples were equilibrated at room temperature for at least 16h. Kp values for the 
radioactive ligand [*H]dihydroalprenolol (*H-DHA) were determined by titrating 
3H-DHA from 0.032 to 100nM. Non-specific binding was determined in presence 
of 141M S-propanolol to block the ligand binding site. Competition assays were 
performed in the presence of 20 nM 3H-DHA (hot ligand) and increasing con- 
centrations of the competitor (cold ligand). Dilutions of alprenolol, atenolol, 
cyanopindolol, dobutamine and isoprenaline were made with phosphate buffered 
saline (PBS, Biochrom, Germany). Due to the limited solubility of carvedilol in 
water, stock dilutions of carvedilol were prepared in DMSO. The final concentra- 
tion of DMSO in the samples was 5%. To test the effect of NB80 binding on 
the affinities (ICs) of isoprenaline for various receptor mutants, the competition 
assays were also carried out in the presence of saturating concentrations of 
NB80 (10,1M). ICso values were determined by fitting the measured radioactive 
counts per minute CPM(X) at a specific concentration X of the competitor 
_ CPMinax — CPMmmin 

x 
C50 

CPMyin are maximal and minimal counts of the assay, respectively. The fits were 
carried in MATLAB (MathWorks, http://www.mathworks.com) with Monte- 
Carlo estimation of errors. K; values were calculated from the obtained ICs 


to the equation CPM(X) 


+ CPMimin, where CPMynax and 


ICs 
A 
Kp 


values according to the formula Kj = where A is the concentration of the 


radioactive ligand and Kp is its affinity for the receptor determined in the direct 
binding experiment. 

G protein activation assay. G protein activation was measured on purified B; AR 
mutants reconstituted with MSP1E3D1” into POPC/POPG nanodiscs. MSP1E3D1 
was expressed and purified as described” and cleaved with TEV protease. 1-pal- 
mitoyl-2-oleoyl-sn-glycero-3-phosphocholine (POPC, Avanti Polar Lipids) and 
1-palmitoyl-2-oleoy]-sn-glycerol-3-phospho-(1'-rac-glycerol) sodium salt (POPG, 
Avanti Polar Lipids) were solubilized at a ratio of 1:1.5 (w/w) POPG/POPC in ND 
buffer (20 mM HEPES pH 8, 100 mM NaCl, 1 mM EDTA) with 50mM sodium 
cholate (Sigma-Aldrich) at 4°C. 133.3 ,1M MSP1E3D1 was incubated with 8 mM 
solubilized POPC/POPG and 10M purified 8; AR in ND buffer with a final con- 
centration of 24mM sodium cholate for 1 h at 4°C. Nanodiscs containing the 
receptor were separated from empty nanodiscs using a cobalt-chelating resin. The 
heterotrimeric G protein was prepared by incubating 10 {1M recombinant Gai; 
and 101M native G3; in activation buffer (25 mM HEPES pH 7.5, 150 mM NaCl, 
2mM MgCh, 1mM DTT) for 30 min at 4°C. 

G protein activation was detected by the change in tryptophan fluorescence 
caused by the exchange of GDP for GTP*S, associated conformational changes 
in the Ga subunit and its dissociation from the GB subunit of the heterotrimeric 
G protein*”. All measurements were carried out on a Varian Cary Eclipse flu- 
orescence spectrophotometer (A.x = 295 nm, Aem = 340 nm, 1.5nm excitation 
slit, 20 nm emission slit, 2s averaging time, 15s cycle time) using final sample 
volumes of 1 ml in 10 x 4 mm cuvettes (Hellma, CH) and magnetic stirrers at 
20°C. Prior to activation, the fluorescence intensity baseline was recorded with 
100 nM heterotrimeric G protein for approximately 500s. The activation was 
started by adding 6nM 8,AR and 10 tM GTP, and the fluorescence intensity 
was monitored for a further 1 h. For experiments in the presence of an agonist, 
the concentrated receptor stock solution (1.5 41M) was pre-incubated for 30 min at 
4°C with 40|.M isoprenaline, and the buffer during the measurements contained 
2M of isoprenaline to maintain the saturation conditions for the receptor. 
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Extended Data Figure 1 | Position of valine residues in the 
thermostabilized mutant TS-G)AR. a, Schematic representation of 


secondary structure and amino acid sequence of TS-8,AR. As compared to 
the wild type, the TS-6, AR has truncations at the N and C termini and the 
intracellular loop (ICL3) as well as nine thermostabilizing point mutations 


(colour coded green) and three additional point mutations (colour 


coded yellow). Valines labelled with '°N are indicated by bold circles for 


assigned (blue) and unassigned (grey) residues. b, Structure of 8, AR in 


complex with carvedilol (PDB code 4AMJ). The protein backbone and 
carvedilol are shown in ribbon and red stick representation, respectively. 


The individual valines are depicted as spheres (blue, assigned; grey, not 


assigned) labelled by residue number. c, Full ‘H-'°N TROSY spectra of 

apo TS-$,AR and all investigated ligand complexes. The ligand chemical 
structures are shown as inserts. Resonances are marked with assignment 
information (black, firm; cyan, tentative). 
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Extended Data Figure 2 | Effect of ligand head group substitution on 
hydrogen bond network involving V172(4.56) in 8, AR. a, Sequence 
alignment of turkey 8; AR and human adrenoreceptors around position 
4.56 in TM4 showing the conserved proline at position 4.60. b, Schematic 
representation of the hydrogen bond network between TM4 and TM5 
involving V172(4.56) in 8, AR. The hydrogen bond network (orange 
dotted lines) originates at the carbonyl of V172(4.56) and connects to 
TMS via two water molecules (w1 and w2, red asterisks). c, Left, phi 

and psi dihedral angles (averages and standard deviations in case of 
multiple chains) in TM4 between residues 4.48 and 4.61 for different 
complexes of 3,AR (ligand, PDB code [chain]: carvedilol, 4AMJ [A,B]; 
cyanopindolol, 2VT4 [B,D] and 4BVN [A]; dobutamine, 2Y00 [A,B] and 
2Y01 [A,B]; isoprenaline, 2Y03 [A,B]). V165(4.49) and V172(4.56), for 
which 'H-!°N resonances could be observed, are shown on a green and 
red background, respectively. c, Right, average of the 'H-'*N resonance 
positions for V165(4.49, green) and V172(4.56, red). The phi and psi 
values of V172(4.56) are distorted from the normal helical angles due to 


8.4 8.2. [ppm] 


the loss of the intra-helical hydrogen bond to P176(4.60). Together with 
the loss of the canonical hydrogen bond, this strongly shifts both the 

'H and !°N chemical shifts of V172(4.56) towards smaller p.p.m. values 
relative to V165(4.49), which has normal, helical phi and psi angles. 

d, Partial views of the crystal structures of the carvedilol complex (PDB 
code 4AM)J), the isoprenaline complex (PDB code 2Y03) and a docking 
model of the atenolol complex based on the cyanopindolol complex 
crystal structure (PDB code 4BVN). In the carvedilol complex, one water 
molecule (w2) forms a hydrogen bond network between the carbonyl 
oxygen of V172(4.56) and the side chains of $211(5.43) and S215(5.461). 
In the isoprenaline complex, the side chains of $211(5.43) and $215(5.461) 
are rotated and form hydrogen bonds to the catechol hydroxyl groups 

of isoprenaline. In this structure, no water molecule is observed at the 
equivalent position of w2. In the atenolol complex, the para-substituted 
acetamide of the ligand head ring (dashed circle) inserts between 
$211(5.43) and $215(5.461) and disturbs the interface between TM3 and 
TM5 near V172(4.56) (solid circle). 
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Extended Data Figure 3 | Response to various ligands for all assigned purple (alprenolol), red (carvedilol), green (cyanopindolol), black (apo 
valine ‘H-1°N resonances in TS-9,AR. Colour coding as in Extended receptor). For clarity the centres of resonances are marked by circles. 
Data Fig. 1: cyan (atenolol), blue (isoprenaline), orange (dobutamine), Firmly (tentatively) assigned residues are marked in black (cyan). 
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Extended Data Figure 4 | Evidence for micro- to millisecond dynamics 
at the ligand entry/exit pathway. Left, region of the 'H-!°N TROSYs 
showing the V314(6.59) 'H-1°N resonance of TS-8,AR in the apo and 
various ligand-bound forms. The resonances of the apo and atenolol- 
bound forms are severely broadened in the !°N dimension indicative of 
backbone dynamics in the micro- to millisecond range. The broadening is 
not observed for the other more tightly binding ligands. The resonances 


for the low affinity ligands, atenolol, isoprenaline and dobutamine, and 
for the high affinity ligands, alprenolol, carvedilol and cyanopindolol 
cluster at different positions. Interestingly, the resonance of the apo form 
clusters with the high affinity ligands, indicative of a similar backbone 
conformation. Right, !°N and !H line widths of the V314(6.59) resonance 
for the apo and ligand-bound forms of TS-8, AR. 
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Extended Data Figure 5 | Isoprenaline-induced G protein activation 

by 8, AR mutants. Activation was measured by the change in tryptophan 
fluorescence upon exchange of GDP for GTP.S in the Ga subunit of the 
heterotrimeric G; protein. The figure shows the time courses of tryptophan 
fluorescence after the addition of different mutant forms of TS-8,AR in 
nanodiscs and GTPS to the heterotrimeric G protein in the presence 
(red) and absence (black) of the agonist isoprenaline. The increase in 
fluorescence intensity at time t= 0 is caused by the additional fluorescence 
of the receptor, the nanodisc scaffold protein and the ligand. For the 

active mutant TS-6,AR(A227Y/L343Y), this initial rise is followed by an 


time [s] 


exponential fluorescence increase due to the activation and dissociation of 
the G protein. The rate of the Ga; activation by TS-B; AR(A227Y/L343Y) 
(half-life ~290 s at 6nM receptor) is approximately 4 times slower than for 
the activation by rhodopsin (half-life ~400 s at 1 nM receptor) using the 
same assay. This assumes that the Ky, of Ga; for TS-8,; AR(A227Y/L343Y) 
is of the same scale as for rhodopsin (8.6 nM) and therefore significantly 
below the (saturating) concentration of Ga; (100nM). Each mutant 

was measured three times except for TS-B; AR(A227Y/L343Y) that was 
measured four times. Representative fluorescence curves from single 
experiments are shown. 
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'H 
Extended Data Figure 6 | Observed NMR effects of the reverse 
mutations of TS-3,AR towards the native 8, AR sequence. a, 'H—-!°N TROSY 
spectra of TS-3, AR and several reverse single and double mutants 
in complex with either atenolol (cyan) or isoprenaline (dark blue). 
Resonances are marked with assignment information (black, definite; 
cyan, tentative). b, Enlarged regions of the 'H—'*N correlation spectra 
showing only the resonance positions of V226(5.57) and V280(6.25) for 
all mutants in both ligand-receptor complexes. The black bar represents 
0.1 p.p.m. in 'H and 1 p.p.m. in N. For all mutants, the resonances 
for V226(5.57) show efficacy-related chemical shift changes between 


95 90 85 80 75 7.0 [ppm 


atenolol (diamonds) and isoprenaline (circles). For the A227Y mutants 
TS-8, AR(A227Y) (green), TS-8;AR(V1291/A227Y) (orange) and 
TS-8,AR(A227Y/L343Y) (cyan), the 'H-!°N resonances of V226(5.57) 
also exhibit an overall shift due to a ring current effect from the introduced 
Y227(5.58) side chain, which has no structural significance. However, 

the V129I mutants TS-8,AR(V1291) (red) and TS-8, AR(V1291/A227Y) 
(orange) exhibit further 'H-'*N shifts towards a more active (that is, bent) 
state of TM5 relative to the mutants that carry the V129(3.40) residue. For 
all mutants, the V280(6.25) resonances fall basically in identical positions 
and show no changes between atenolol and isoprenaline. 
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Extended Data Figure 7 | Binding of NB80 to agonist-activated 
TS-8,AR(A227Y/L343Y). a, Sequential addition of isoprenaline (1 mM, 
blue), NB80 (120M, red), and cyanopindolol (1 mM, green) to 
TS-8,AR(A227Y/L343Y) (120M) followed by 'H-'°N TROSY spectra. 
Addition of an equimolar amount of NB80 changes the spectrum 
drastically. Except for few resonances (62, 89, 102, 103, 172, 280 and 326), 
the assignments are not transferable from the isoprenaline complex. The 
addition of cyanopindolol to this mixture apparently replaces isoprenaline 
in the receptor and drives the spectrum to the purely cyanopindolol- 
bound form. These results indicate that TS-8, AR(A227Y/L343Y) can be 
activated, but that the fully active state requires stabilization with a 

G protein-like partner at the intracellular site. b, Changes in isoprenaline 
affinity of 8;AR induced by NB80 binding. The isoprenaline affinity 

was assayed for TS-8,AR (orange), TS-3,AR(A227Y/L343Y) (green), 

and the truncated native turkey 8; AR receptor (t8trunc, blue)!” by 
radio-ligand competition using *H-dihydroalprenolol in the presence 

of saturating amounts of NB80 (10M, squares) or its absence (circles). 
For TS-8; AR(A227Y/L343Y) and tStrunc the apparent affinity increases 
by two log units in the presence of NB80, whereas no change occurs for 
TS-B, AR. Data points are shown as mean and standard deviation of three 
independent experiments for each condition. c, Combined 'H and '°N 


TM6 


chemical shift changes (A6 = (A6yy7/2 + Adj5n7/50)"/7) between the 
isoprenaline-TS-3; AR(A227Y/L343Y) and the NB80-isoprenaline-TS- 

8, AR(A227Y/L343Y) complexes as a function of residue number. Except 
for the few assigned peaks in the NB80-isoprenaline-TS-8;AR(A227Y/ 
L343Y) complex, chemical shift deviations are calculated as the minimal 
difference from the isoprenaline peak position to the nearest peak position 
in the isoprenaline-NB80 complex. Note that these minimal chemical 
shift deviations present a lower boundary for the deviations that could be 
obtained from a full assignment of the isoprenaline-NB80 complex. 

Peak deviations are colour-coded in red (>0.2 p.p.m.), orange 

(0.2-0.05 p.p.m.), blue (<0.05 p.p.m.), and grey (ambiguity from peak 
overlap). d, Representation of the chemical shift response to nanobody 
binding given in c on the structure of 3,AR. Valines in TM1 and TM7 do 
not show a large chemical shift response. Due to overlap the response for 
V280(6.25) in TM6 is unclear. Valines in TM2 show a strong response 
around the sodium binding region near residue D87(2.50), but valines at 
its extracellular side are not responsive. In general, valine resonances at the 
extracellular sides of TM3-TM6 are severely shifted. This indicates that 
the NB80 binding at the intracellular side has a long-range effect on the 
extracellular side near the ligand binding site. 
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Extended Data Table 1 | Pharmacological and geometrical properties of the 8;AR ligands used in this study 


pharmacology geometry 
pKp pK; plCso Gs efficacy insertion tail volume" 
B36 -m23" TS-B,AR’ TS-B:AR' tBtrune’ depth! 
cyanopindolol 10.01 +0.11 >8.4 + 0.3% >8.140.1% 39+1 7.22 90.11 
carvedilol 8.72 + 0.09 >8.3 + 0.3% >7.9+0.1* 12+0.4 7.62 163.47 
alprenolol 7.35 + 0.07 7.6 +0.3 7.2+0.1 3142 7.25 73.87 
dobutamine 5.49 + 0.03 5.0 + 0.3 4.6+0.1 112+7 8.46 170.34 
isoprenaline 5.16 + 0.05 4.940.3 4.6+40.2 100 7.92 73.87 
atenolol 4.29+0.11 5.0 + 0.3 46+0.2 2.141 ND* 73.87 


*Values for binding affinity correspond to the 836-m23 construct, which contains C- and N-terminal deletions and six thermostabilizing mutations!°. The pKp = —log Kp [M] was obtained from [2H] 
CGP12177 whole-cell binding assays in stably expressed CHO cells!”. 

+pK; values were obtained from the plCso values given in column 3 by adding log(1+[lig]/Kp) = 0.366 as a correction for the concentration of the [SH]dihydroalprenolol (#H-DHA) ([lig] =20 nM) and its 
determined dissociation constant (Kp = 15.147 nM, see Methods). Errors are obtained by error propagation from the errors of the plCso values and Kp. 

+plCso values were determined by the radioligand inhibition assay described in the Methods section using 3H-DHA. Data are given as mean and standard deviation of three independent experiments. 
§Values for efficacy towards Gs-mediated signalling as determined by adenylyl cyclase activation of the truncated native turkey 81AR receptor (t3trunc)!”. The efficacies are indicated as the percentage 
of the (maximal) isoprenaline response obtained from #H-cAMP accumulation assays in CHO cells. 

|The insertion depth (A) of the ligand was taken as the distance between the 8-carbon atom of the ligand amino group and the amide nitrogen atom of V125 (V117 for 82AR) in the crystal structures 
of turkey 81AR in complexes with isoprenaline (PDB ID: 2Y03), dobutamine (PDB ID: 2Y00), carvedilol (PDB ID: 4AMJ), and cyanopindolol (PDB ID: 4BVN) as well as of human 82AR in complex with 
alprenolol (PDB ID: 3NYA). 

{The tail volumes (A3) were calculated by the Molinspiration Property Calculation Service (http://www.molinspiration.com/cgi-bin/properties) for the tail group including the amino moiety. 

#The pj and plCso are limited by the Kp and concentration of the radioactive tracer ligand as well as by the concentration of the receptor in the competition assay. 

x Not determined, no crystal structure available. 
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Extended Data Table 2 | Sequence-specific assignment 1H-15N valine resonances 


Assigned Position Point Spectra with ligands Further 
residue mutation information 
V62 1.53 V62A Ate, Apo, Alp 
v90 2.53 V90A Ate, Alp, Car, Cya HN(CO) 
V94 2.56 V94A Alp, Car, Cya 
V102 2.64 V102A Ate, Apo, Alp, Iso, Dob, Car, Cya 
V122 3.33 V122A Alp, Car, Cya 
V125 3.36 V125A/I Alp(A), Car(A), Ate(l), Apo(l) 
V129 3.40 Vv1291 Ate 
V134 3.45 V134A Ate 
V160 4.44 V160A Ate 
Vv172 4.56 V172A Ate, Alp Pred 
V202 ECL V202A Alp Pred 
V226 5.57 V226A Alp, Car, Cya 
V230 5.61 V230A Alp 
V280 6.25 V280A Alp, Car, Cya 
V298 6.43 V298A Ate HN(CO) 
V314 6.59 V314A Alp, Car, Cya 
V326 7.36 V326A Alp 
Vv89 2.52 - Ate, Apo, Alp, Iso, Dob, Car, Cya HB, NM 
V95 2.57 - Ate, Apo, Alp, Iso, Dob, Car, Cya HB, NM, HN(CO) 
V103 2.65 - Ate, Apo, Alp, Iso, Dob, Car, Cya HB, NM, HN(CO) 
V165 4.49 - Ate, Apo, Alp, Iso, Dob, Car, Cya NM 


Ate, atenolol; Alp, alprenolol; Iso, isoprenaline; Dob, dobutamine; Car, carvedilol; Cya, cyanopindolol 

HN(CO): information from 2D-filtered HN(CO) of [!5N,!8C]Gly,Val-, [!SN]Leu-labelled receptor 

Pred: ShiftX2 15N chemical shift prediction?! 

HB: expected !HN shift based on hydrogen bond length analysis 

NM: peak shift due to mutation of neighbouring residue 

Assignments for V89(2.52), V95(2.57), V103(2.65), V122(3.33), and V165(4.49) are tentative. V51(1.42), V52(1.43), V56(1.47), V60(1.51), V309(6.54), V312(6.57), and V320(ECL) have not been 
assigned. 
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Extended Data Table 3 | Summary of the NMR response to ligands, G protein activation and NB80 binding data for the original TS-8;AR 
construct and various reverse mutants towards the native 8;AR sequence 


ey Sear residue V226 V280 G protein NB80 Tn PC] 
129 227 343 response* _response* _ activation’ binding* oT 
original TS-B,AR Vv A Lk + - (4.55/4.52) 58.9 +/— 0.6 
TS-BiARvi291 I' A L + - - N.D.! 58.6 +/— 0.2 
TS-BiARaz2z7y Vv Y L + . - - 54.9 +/-0.1 
TS-BiAR_343y Vv A Y + - - - 54.7 +/-0.3 
TS-BiARviz9va227v I Y L + - - N.D. 54.4 +/-0.1 
+ 
TS-BiARazazviisa3y Vv Y Y + : + (4.36/6.32) 47.8 +/-0.2 


*Determined from the shifts of the 'H—!®N resonances in response to binding of atenolol and isoprenaline (see Extended Data Fig. 6B). 

Determined from the G; activation assay described in Extended Data Fig. 5. 

+Determined from NMR experiments by mixing of isoprenaline-activated TS-8;AR with NB8O. The !H-!5N spectra are shown for TS-8;AR(A227Y/L343Y) in Extended Data Fig. 7. The spectra of the 
other TS-3:AR mutants showed no response to addition of NB80 up to equimolar concentrations of ~100 M for both proteins. Values in parentheses show plCso for isoprenaline binding in the 
absence or presence of NB80, respectively (see Extended Data Fig. 7). 

§Melting temperature T,, of the apo form receptor determined by the CPM thermal shift assay. 

\|Native residues are highlighted in bold. 

Not determined. 
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CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/nature16134 


Corrigendum: The ‘obligate 
diploid’ Candida albicans forms 


mating-competent haploids 


Meleah A. Hickman, Guisheng Zeng, Anja Forche, 

Matthew P. Hirakawa, Darren Abbey, Benjamin D. Harrison, 
Yan-Ming Wang, Ching-hua Su, Richard J. Bennett, 

Yue Wang & Judith Berman 


Nature 494, 55-59 (2013); doi: 10.1038/nature11865 


In this Article, there were two errors in allele assignments. In the leg- 
end and colour key to Fig. 1b, the allele identity (‘@ and ‘b’) is inad- 
vertently reversed. Figure 1 of this Corrigendum shows the corrected 
panel b and legend for the original Fig. 1. A similar error is present 
in the colour key to Supplementary Fig. 5 (its legend is correct). The 
Supplementary Information to this Corrigendum shows the corrected 
Supplementary Fig. 5. 
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Figure 1 | This figure shows the corrected panel b of the original 

Fig. 1. SNP/CGH array analysis of indicated strains (left) showing copy 
number (log, ratio, black) and SNP allele information (grey, heterozygous; 
cyan, allele ‘a; magenta, allele ‘b’; white, no SNP data). 


Supplementary Information is available in the online version of the Corrigendum. 
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CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/nature16179 


Corrigendum: Human body 
epigenome maps reveal 
noncanonical DNA methylation 


variation 


Matthew D. Schultz, Yupeng He, John W. Whitaker, 
Manoj Hariharan, Eran A. Mukamel, Danny Leung, 
Nisha Rajagopal, Joseph R. Nery, Mark A. Urich, 
Huaming Chen, Shin Lin, Yiing Lin, Inkyung Jung, 
Anthony D. Schmitt, Siddarth Selvaraj, Bing Ren, 
Terrence J. Sejnowski, Wei Wang, & Joseph R. Ecker 


Nature 523, 212-216 (2015); doi:10.1038/nature14465 


The y-axis label in Fig. 3f of this Letter should read ‘[mCAC/CAC]/ 
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from patients with bipolar disorder 
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CAREERS 


DIVERSITY ADVOCATE Astrophysicist sends FACULTY MEMBERS Tips for effective | NATUREJOBS For the latest career 
message to US Supreme Court p.245 mentoring go.nature.com/6g4bpp listings and advice www.naturejobs.com 


he glossy poster at my university career 
| planning and placement centre both 
intrigued and perplexed me: “PhDs: 
come learn about a career in management 
consulting — recruiting reception tonight!” 
As a PhD student in geology, I was dimly 
aware of the name of the consulting firm that 
had organized the event; one of my friends 
® had accepted a position with the firm after he 
graduated. I had heard that he was earning 
a great salary and travelling a lot, and that he 
was aiming to go back for a master’s degree 
@ in business administration (MBA). Until I 
saw the poster, I had no idea that a strategic- 
management-consulting firm would even 
consider hiring non-MBA types, let alone 
recruit PhDs specifically. 

I applied for a position and landed a series of 
interviews that culminated in a day of them at 
the company’s offices in Los Angeles, California. 
I did not receive an offer in the end, but it was 
an illuminating experience. Mostly, I was sur- 
prised that someone with a freshly minted PhD 
could immediately earn US$160,000. And that 
was in 1994, 

Fast forward 22 years. Now, as the chief 
executive of a US technology company, I have 
recruited PhDs for both technical and business 
positions. I have found across all scientific disci- 
plines that those with doctoral degrees possess 
many of the skills that are in highest demand in 
today’s economy. If you have earned a PhD, you 
know, for example, how to analyse data. You also 
understand how to examine those results to gain 
insights. In some important ways, you are better 
prepared than MBA holders to make valuable 
contributions to the business world. You have 
learned resilience in the face of uncertainty and 
with limited resources. 

Yet you and many other PhD graduates — 
along with the programmes that trained you 
— remain largely unaware of or uninterested in 
opportunities outside academia. In turn, only 
a few companies, such as the consulting firm 
whose recruiting poster I saw 20 years ago, 
have pre-emptively recognized the value that 
you and your colleagues can bring — and they 

are reaping a harvest of talent as a result. 
C 0 L U M Ii You are doing yourself a disservice. As a doc- 
toral degree-holder, you need to appreciate your 
e e degree programme for the transferable skills 
A b T id Cc to bu S ine S S that it confers (see “Top transferable skills for 
business’), and recognize that those skills pro- 


vide you with significant and immediate advan- 


PhD holders should not underestimate their value to ee Oe om maser 
‘ ; : You do not need a business degree or substantial 
industry and the business sector, says Peter Fiske. extra training to secure satisfying and highly > 
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> compensated work in the business sector. 

Need more convincing? Let’s examine the 
differences between the training components 
of each degree programme. 

Ina typical MBA scheme, students listen to 
lectures, work on projects and learn about busi- 
ness-friendly topics such as economics, finance, 
accounting, organizational behaviour and law. 
Fundamentally, the MBA is a conventional, 
curriculum-led educational experience. 
Although internships and student-led organi- 
zations provide opportunities for direct work 
experience, success in business school is largely 
a function of earning good grades. Students 
graduate after two years or so, and top perform- 
ers can land jobs that start at $150,000 or more. 

Most doctoral programmes also require a 
large body of course work. But as a PhD student, 
you probably spent more time ‘doing’ rather 
than studying or passively listening. You con- 
ducted original scientific research, which may 
have included fieldwork, you taught younger 
students and you were possibly involved with 
at least one manuscript. 


DATA DELUGE 

Science PhD students particularly benefit from 
near-constant immersion in emerging tech- 
nologies and, especially, in data analysis and 
hypothesis testing. This is a huge advantage in 
the business world: every industrial sector and 


type of business is more reliant on technology 
and data now than ever before. And in today’s 
economy, the volume and complexity of the 
information that businesses use require man- 
agers and executives to understand the basics 
of data analysis and statistics — and to have a 
deeper ability to construct and test hypotheses 
and to build and validate models. 

And science-PhD holders have a further 
advantage. Not only is technological innova- 
tion a key to the development of products and 
services, but the tools that are used to run busi- 
nesses have science and technology embedded 
in them. Customer-relations management 
software, for example, uses advanced statisti- 
cal algorithms that analyse trends and identify 
important variables such as user behaviour and 
pricing trends. 

MBA graduates do not routinely encounter 
such training or confront these requirements 
in their degree programme, whereas they are 
an integral component of the PhD programme. 

In short, the PhD experience is much more 
like the real world of business than is the 
MBA programme. If educational institutions 
were to train MBA students in the same way 
that they train PhD students, they would 
require business students to complete all the 
required course work — and to launch a suc- 
cessful business. 

Given that the PhD is such a powerful 


THE TALENT WITHIN 


Top transferable skills for business 


If you have earned a science PhD, you were 
probably told by mentors, advisers and 
career-development specialists that you 
will need to develop a lot of new skills to 
succeed in any sector outside academia. 
But your PhD programme has already 
conferred many skills that are important, 
even crucial, in the business world, and that 
are comparable to — and in some cases 
superior to — the talents acquired ina 
graduate-level business programme. Here 
are some examples. 

@ Data analysis You were trained to gather, 
evaluate, synthesize and present data, and 
to uncover relationships, correlations and 
trends. The business world increasingly 
relies on the same methodologies to develop 
strategies and identify opportunities. 

@ Resourcefulness You probably had to 
create experiments, methodologies and 
analyses with limited resources and under 
tight time constraints. Successful business 
people are often challenged to develop 

a product or service while facing the 

same difficulties. 

@ Technological awareness You were 
trained to understand the fundamentals 

of a range of technologies. Many of these 


technologies are at the heart of products 
and services in the private sector. 

@ Resilience You may have encountered 
unexpected setbacks in your research or 
studies, yet powered through to reach your 
goals. This resilience in the face of challenge 
often separates the most successful 
entrepreneurs from the rest. 

@ Project management Completing a PhD 
typically requires the coordination and 
scheduling of disparate resources and 
individuals — as well as thinking through all 
aspects of a complex project or activity. The 
same course of action is a core component 
of the business world. 

@ Problem-solving You had to use novel 
thinking and innovative frames of reference 
to identify and solve technical problems. The 
ability to reframe problems to identify novel 
solutions is a key skill in business. 

@ English proficiency You are probably 
skilled in English, the most prevalent 
language of international business. 

@ Written communication PhD holders often 
have extensive experience in writing and 
describing complex ideas and methodologies. 
Effective written communication is crucial to 
business success. PF. 


244 | NATURE | VOL 530 | 11 FEBRUARY 2016 


© 2016 Macmillan Publishers Limited. All rights reserved 


training foundation for success in the private 
sector, one might presume that this value would 
be reflected in the salaries of PhD graduates as 
compared to those of their MBA counterparts. 
Unfortunately, the opposite is true. 

For example, a PhD-trained biochemist 
with five years’ work experience after their 
PhD who is employed as a ‘PhD biochemist 
I (a typical job title in industry) in the United 
States has an average annual salary of roughly 
$68,000, according to salary.com, a major sal- 
ary database. In contrast, a person with the 
title ‘market-research supervisor IIT; who also 
has five years’ work experience and an MBA, 
is likely to earn $91,000. 

Both jobs require an ability to analyse data, 
synthesize technical information, perform 
tests to validate hypotheses and supervise 
a team. But whereas the market-research 
supervisor almost certainly could not walk 
into the PhD biochemist’s position, the bio- 
chemist could easily perform all the duties of 
the research supervisor — with just a small 
amount of tailored training. 


BRIDGE THE GAP 

But therein lies the glitch. Ifa PhD provides 
graduates with such a range of strengths and 
transferable skills, why don't doctorate hold- 
ers immediately command higher salaries? 
Part of the problem is that PhD students do 
not routinely receive such tailored training in 
the skills that are highly valued in the business 
sector. Practical professional skills such 
as negotiation, communication, business 
strategy, basic economics and marketing are 
all abundantly represented in the curricula of 
leading MBA programmes. 

Yet only a handful of research universi- 
ties train PhD students in the basics of busi- 
ness. Some institutions, such as Princeton 
University in New Jersey, have begun to pro- 
vide professional-development curricula to 
those in PhD programmes. It is likely that 
graduates from these places will face better 
professional and economic prospects than 
those without such training, because they are 
better prepared to compete for a broad range 
of professional opportunities. 

There may be a disincentive for science and 
engineering education programmes (and the 
agencies that fund them) to support the imple- 
mentation of such professional-development 
classes or initiatives. The research enterprise 
thrives on an abundant reservoir of highly 
talented, highly motivated PhD students and 
recent graduates, and does not want to lose 
them to the business sector. 

Greater competition for PhD-trained 
talent from the private sector, however, would 
invariably raise salaries for entry-level PhD 
graduates. Furthermore, science-funding 
agencies rely on PhD programmes and indi- 
vidual advisers to provide the bulk of profes- 
sional development to their students. Many 
research faculty members lack sufficient 
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industry or professional-development 
experience to provide the sort of training 
that MBA students receive as a customary 
part of their degree. 

Ifa greater proportion of PhD graduates 
were to transition into the business sector, 
however, it could create pressure on univer- 
sities to provide such training and on fund- 
ing agencies to require it. 

In the meantime, this training gap can 
easily be filled by taking short courses or 
programmes in business skills. Many are 
available online as massive open online 
courses, and some offer certifications. 

Still, even more so than the paucity of 
professional-development programmes 
for PhD students, the greatest barrier toa 
high-paying position in the business sector 
is your personal beliefs about what you are 
qualified to pursue. 

As someone with a PhD background, 
you probably view yourself foremost 
— even solely — as 


aresearch scientist. “The business 
The broad set of world is full 
valuable transfer- of delicious, 
able skills that you compl ex 
developed while intellectually 
in graduate school exciting 

go largely unrec- problems.” 


ognized and unar- 
ticulated within the 
academy. Most PhD graduates restrict their 
job searches to what they feel qualified to do, 
rather than exploring what they are capable 
of doing. 

And as a PhD graduate, you may believe 
that academia is the only sector in which 
you can enjoy intellectual freedom and 
work on challenging problems. This is 
incorrect. The business world is full of 
delicious, complex, intellectually exciting 
problems. Their resolution can yield enor- 
mous value, both to those who solve the 
problems and to society. 

Some people are beginning to recognize 
this reality. Graduate students and postdocs 
are organizing their own professional- 
development programmes, sometimes 
recruiting business-school professors to 
help them (see Nature 485, 269-270; 2012). 

And more businesses, especially tech- 
nology-related companies, are either 
being launched by graduates of doctoral 
programmes or have a PhD holder on the 
founding team. 

As more PhD students and graduates 
learn about these opportunities from their 
brethren, we can expect further interest in, 
and greater pursuit of, business careers by 
doctorate holders. MBAs, beware. m 


Peter Fiske is chief executive of PAX Water 
Technologies in Richmond, California, 

and author of Put Your Science to Work 
(American Geophysical Union, 2001). 


TURNING POINT 


Diversity ruling 


Lia Corrales, a postdoctoral researcher at 

the Massachusetts Institute of Technology 
(MIT) in Cambridge, is fighting for equality in 
astronomy — a field that has been plagued by 
allegations of harassment and low diversity. Last 
December, Corrales and her colleagues wrote 

an open letter to the US Supreme Court after 
some judges questioned whether a federal policy 
known as affirmative action has helped people 
from minority groups to become scientists. 


What happened at the Supreme Court? 

The justices were debating the merits of affirm- 
ative action, which aims to protect against dis- 
crimination on the basis of race or gender, in 
higher education. Justice Antonin Scalia noted 
that some people contend that the policy does 
not benefit African Americans because they do 
better at a less-advanced, slower-track school. 
And the chief justice, John G. Roberts Jr, asked 
what “unique perspective” a minority student 
could bring to a physics class. 


Why did you write an open response? 

I was angry that the justices’ comments implied 
that affirmative action has no value, and knew 
that friends were upset, too. Astrophysicist 
Josh Tan, a colleague of mine, and I discussed 
writing a letter because, as physicists and 
astronomers who care about diversity, we're not 
doing our job if we dort stand up for affirma- 
tive action. In the end, five of us wrote it. I put 
the letter online at eblur.github.io/scotus and 
mailed printed copies to the justices on the case. 
We gathered more than 2,400 signatures. 


Describe your pursuit of astronomy. 

As a kid, I loved black holes and thought 
Stephen Hawking was the coolest, so it seemed 
natural to study physics and astronomy. While 
I was an undergraduate at Harvey Mudd 
College in Claremont, California, roughly 
one-third of the students there — but probably 
half of my physics class — were women. And 
there were plenty of women at Columbia Uni- 
versity in New York, where I earned my PhD 
studying interstellar dust. But when I came to 
MIT, where only 3 or 4 of about 30 postdocs at 
my institute were women, it became apparent 
how few of ‘me there were. 


When did you first work on diversity issues? 

As an undergraduate. The diversity office at 
Harvey Mudd College asked me to help start 
a branch of the US Society of Hispanic Profes- 
sional Engineers. But I also wanted to make a 
difference to aspiring university students, so 
we connected with a preparatory programme 


for students who would be the first in their 


family to attend university. I now realize that 
I'm passionate about this issue because of my 
own Latin-American background and my 
angst about affirmative action. 


You are part of a Facebook group called Equity 
and Inclusion in Physics and Astronomy. Can 
you tell us more about it? 

I’m not a founder, but it started out as a 
‘Women in Astronomy’ group on Facebook, 
and evolved into something that was about 
more than just gender. There are many ways 
in which people experience discrimination, 
so now it’s more of an umbrella group to 
increase diversity. 


What else have you been involved with? 

Last summer, an Inclusive Astronomy 
conference was held at Vanderbilt University 
in Nashville, Tennessee — an institution where 
several faculty members have tackled diversity 
issues. It was an amazing event. There was so 
much acceptance that we must confront stereo- 
types and move beyond the ways in which we 
allow or contribute to discrimination. 


Are things beginning to change in astronomy? 
There is a critical mass of people who are 
talking about diversity and inclusivity. One 
recommendation from the conference was to 
stop the requirement for minimum test scores 
for admission to US graduate schools. This 
stance has already been adopted by the Ameri- 
can Astronomical Society — a step forward. 
There’s still a long way to go, but I feel more 
inspired and empowered than I have in a really 
long time. m 


INTERVIEW BY VIRGINIA GEWIN 


This interview has been edited for length and clarity. 
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Ua SCIENCE FICTION 


THE MAN WITH THE SPIDER 


BY MOHAMAD ATIF SLIM 


C C want out.” 
“Tm sorry, sir?” Surely she had mis- 
heard. 


“T want to get out?” 

The receptionist seemed slightly 
alarmed. Her eyes stole to the intra- 
venous device on the man’s forearm. 

“Sir, are you ... sober?” She hesitated. 
“We do offer a wide range of reversal 
agents for all classes of pharma, guaran- 
teed not to compromise any subsequent 
pleasure —” 

“But I am sober!” Carusi was dismayed 
to hear the creeping hysteria in his voice. 
“Please. Id like to leave.” 

“Tm... not sure if that’s possible, sir” 

“I signed the contract, I know. But I’ve 
changed my mind. Do whatever you want 
with me, just not this. I can’t go through with 
this. I need to see my wife.” 

Her eyes again fell on Carusi’s arm. 

“Please. If youre not able to help, can you 
get someone who can?” 

“I... Certainly, sir. Please have a seat.” 

As she retreated into a back room, Carusi 
gazed anxiously out of the window. The sky 
was azure over plains of green, and a dis- 
tant bird soared above snow-flecked moun- 
tains. The hyper-realistic projections were 
designed to deceive; MARTYR was at least 
100 kilometres underground. 

Like many, Carusi had lapped up the end- 
less supply of stimulants at the beginning. 
Without sleep, you start to lose count of the 
days. Time becomes an abstraction of pleas- 
ure after pleasure after pleasure. For most, 
the hour of martyrdom approached with 
stealth: a brief, errant breeze, then quiet. 

But yesterday, Carusi decided to forgo the 
diafinil. Something gnawed at the back of 
his mind, a place no pharma could touch. 
For the first time in perhaps a week, he 
slept, only to wake up hours later, drenched 
in sweat. Colomen’s face swam across his 
vision, and he stifled a cry. 

That was when panic, that many-legged 
spider, took residence in the pit of his stomach. 

He needed to go home. 

How many days had passed since he 
enlisted? 

How many did he have left? 

What a fool he was! 

He had wanted to lower her mother’s 
medical cap — palliative, then assisted 
decline, after 60. Sell the excess. It was the 
only way they could afford a reproductive 


Escape clause. 


licence before June. They had already missed 
the sterilization lottery two years in a row. 
Three would be too lucky. 

And Ma didnt need to know. 

But Colomen was stubborn. “We'll work, 
Carusi. We'll earn our licence” 

The arguments were long and many, but 
the last was the worst. Carusi had stormed 
out. How could Colomen insist on choosing 
her mother over him? Over their future? 

He might have left for hours. He might 
have got drunk and seen the ad for MARTYR 
in the subway: “3D-printing for food will 
saturate in 2 years — for the sake of the chil- 
dren, enlist as a Martyr!” 

And he might have, in the heat of the 
moment, thought it was a good idea. 

Martyrdom! For the sake of the child he 
would never have. 

The registrar didn't ask questions, even 
though Carusi was slurring his words. 
Martyrdom was a right; in exchange, state- 
sanctioned pleasure for 13 days — bound- 
less, limitless pleasure. And they distort the 
perception of time too, hed heard. Make 
13 days feel like for ever. 

But hed also heard other things. Rumour 
orbits MARTYR like flies round a carcass. 

The government calls it the solution to 
over-population: the old for the new, the 
infirm for the healthy. Cattle are long dead, 
and oil wells dry; scientists say once the 
population reaches critical mass, anarchy 
will prevail — the children will suffer. 

Yet, others call it foul. Class-based eugen- 
ics, they say. Institutionalized nihilism. 
They say only the proletariat enlist because 
society offers them nothing else. That 
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MARTYR cannot save a corrupt world. That 
in secret places around the Metropolis, the 
bourgeoisie scramble to bid at auctions for 
licences the Martyrs forfeit. 
But Carusi didn't care about politics, or 
much about the world. He had enlisted 
because he was angry, petty and spiteful. 

He knew that now. 

“Mr Onnodig?” 

Carusi turned with a start. A well- 
dressed man beckoned him into the back 
room, an expression of grave concern on 

his face. 

“Tam very sorry, sir, he began, once they 
were seated. “I empathize with your predica- 
ment, but MARTYR cannot allow for any 
process reversal. The cogs turn one way.” 

Inside, the spider gave a malevolent hiss. 
“But I was unwell!” Carusi pleaded, then 
added, shamefully: “And inebriated. I was 
angry with my wife.” 

“T would like to help, but we have no legal 
provisions.” He handed Carusia holotablet. 
“Your contract, sir. Page 254. Clause 537. An 
Acute Stress Reaction is not tenable grounds 
for denying a Citizen his right to Martyrdom? 

“But what about my rights now?” 

“Martyrs have basic rights, but their 
right to Martyrdom supersedes all else.” He 
paused. “You must understand — MARTYR 
only works if everyone plays their part. The 
government compensates the citizen with a 
commensurate entitlement to pleasure. In 
turn, you surrender your licence to live.” 

Carusi felt the spider climb into the hol- 
low of his chest, lodging itself beneath his 
ribs. Its jaws chittered angrily. 

He stared at the opposite wall, unable to 
find comfort in the gaunt figure reflected in 
its polished wooden panels. 

The girl from earlier approached him with 
a syringe, her gestures gauche. 

“Lotazolam, sir. It can help with anxiety,” 

Carusi ignored her. “Can I call my wife?” 

The man shifted uncomfortably in his seat. 
“This is very... unconventional for us. No 
one’s ever asked to leave.” The man seemed 
to debate internally, glancing at the girl. “But 
we will spare you five minutes, no more.” 

They left the man with the spider alone in 
the room. A minute later, his wife's face flick- 
ered onto the holotablet, and Carusi broke 
down into sobs. m 
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